US 11,670,281 B2
Adaptive text-to-speech outputs based on language proficiency
Matthew Sharifi, Kilchberg (CH); and Jakob Nicolaus Foerster, San Francisco, CA (US)
Assigned to Google LLC, Mountain View, CA (US)
Filed by Google LLC, Mountain View, CA (US)
Filed on Jan. 20, 2021, as Appl. No. 17/153,463.
Application 17/153,463 is a continuation of application No. 16/573,492, filed on Sep. 17, 2019, granted, now 10,923,100.
Application 16/573,492 is a continuation of application No. 16/135,885, filed on Sep. 19, 2018, granted, now 10,453,441, issued on Oct. 22, 2019.
Application 16/135,885 is a continuation of application No. 15/653,872, filed on Jul. 19, 2017, granted, now 10,109,270, issued on Oct. 23, 2018.
Application 15/653,872 is a continuation of application No. 15/477,360, filed on Apr. 3, 2017, granted, now 9,886,942, issued on Feb. 6, 2018.
Application 15/477,360 is a continuation of application No. 15/009,432, filed on Jan. 28, 2016, granted, now 9,799,324, issued on Oct. 24, 2017.
Prior Publication US 2021/0142779 A1, May 13, 2021
Int. Cl. G10L 13/00 (2006.01); G06F 40/253 (2020.01); G06F 40/289 (2020.01); G10L 13/08 (2013.01)
CPC G10L 13/00 (2013.01) [G06F 40/253 (2020.01); G06F 40/289 (2020.01); G10L 13/08 (2013.01)] 18 Claims
OG exemplary drawing
 
1. A computer-implemented method when executed on data processing hardware causes the data processing hardware to perform operations comprising:
during a registration process for a client device:
receiving demographic information for a user of the client device; and
designating a language proficiency to the user based on the received demographic information, the language proficiency designated to the user comprising one of a first level of language proficiency or a second level of language proficiency different than the first level of language proficiency;
receiving a voice query input to the client device by the user;
generating audio data comprising a synthesized utterance of a particular text segment responsive to the voice query and based on the language proficiency designated to the user, the particular text segment comprising one of:
a first text segment when the language proficiency designated to the user comprises the first level of language proficiency, the first text segment comprising a respective independent clause conveying primary information responsive to the voice query; or
a second text segment when the language proficiency designated to the user comprises the second level of language proficiency, the second text segment comprising a respective independent clause and one or more subordinate clauses, the one or more subordinate clauses of the second text segment conveying additional information responsive to the voice query that is not included in the first text segment; and
providing the audio data for audible output from the client device.