US 12,254,870 B2
	Acoustic-based linguistically-driven automated text formatting
Julie A. Van Dyke, New Haven, CT (US); Michael Gorman, Edina, MN (US); and Mark Lacek, Minneapolis, MN (US)
Assigned to Cascade Reading, Inc., Edina, MN (US)
Appl. No. 18/033,243
Filed by Cascade Reading, Inc., Edina, MN (US)
PCT Filed Oct. 6, 2022, PCT No. PCT/US2022/045924 § 371(c)(1), (2) Date Apr. 21, 2023, PCT Pub. No. WO2023/059818, PCT Pub. Date Apr. 13, 2023.
Claims priority of provisional application 63/262,166, filed on Oct. 6, 2021.
Prior Publication US 2024/0257802 A1, Aug. 1, 2024
Int. Cl. G06F 40/103 (2020.01); G10L 13/02 (2013.01); G10L 15/02 (2006.01); G10L 15/05 (2013.01); G10L 15/18 (2013.01); G10L 25/90 (2013.01); G10L 25/93 (2013.01)

CPC G10L 15/1822 (2013.01) [G10L 13/02 (2013.01); G10L 15/02 (2013.01); G10L 15/05 (2013.01); G10L 25/90 (2013.01); G10L 25/93 (2013.01)]

25 Claims

1. A method for generating a text arrangement from acoustic input, comprising:

obtaining an audio sample that includes multiple speech segments providing words from human speech;

identifying acoustic properties of the words from the audio sample using an acoustic analyzer provided by an audio processing service, wherein the acoustic properties are identified from an acoustic signal of the audio sample, and wherein the acoustic analyzer extracts at least one waveform from the acoustic signal;

computing respective measurements of the acoustic properties for each of the words, using the acoustic analyzer, based on direct measurement of the acoustic signal or comparisons within the acoustic signal captured from the at least one waveform;

generating an acoustic language model in a computer-readable data structure to represent a linguistic relationship of the words, based on the acoustic properties of the words from the audio sample, wherein the linguistic relationship of the words is determined from the respective measurements of the acoustic properties; and

outputting data to arrange the words into a cascade format, based on the linguistic relationship, wherein the cascade format establishes horizontal displacement and vertical displacement among the words.