| CPC G10L 15/1822 (2013.01) [G10L 13/02 (2013.01); G10L 15/02 (2013.01); G10L 15/05 (2013.01); G10L 25/90 (2013.01); G10L 25/93 (2013.01)] | 25 Claims |

|
1. A method for generating a text arrangement from acoustic input, comprising:
obtaining an audio sample that includes multiple speech segments providing words from human speech;
identifying acoustic properties of the words from the audio sample using an acoustic analyzer provided by an audio processing service, wherein the acoustic properties are identified from an acoustic signal of the audio sample, and wherein the acoustic analyzer extracts at least one waveform from the acoustic signal;
computing respective measurements of the acoustic properties for each of the words, using the acoustic analyzer, based on direct measurement of the acoustic signal or comparisons within the acoustic signal captured from the at least one waveform;
generating an acoustic language model in a computer-readable data structure to represent a linguistic relationship of the words, based on the acoustic properties of the words from the audio sample, wherein the linguistic relationship of the words is determined from the respective measurements of the acoustic properties; and
outputting data to arrange the words into a cascade format, based on the linguistic relationship, wherein the cascade format establishes horizontal displacement and vertical displacement among the words.
|