US 12,462,793 B2
Speech recognition hypothesis generation according to previous occurrences of hypotheses terms and/or contextual data
Ágoston Weisz, Zurich (CH); Alexandru Dovlecel, Zurich (CH); Gleb Skobeltsyn, Kilchberg (CH); Evgeny Cherepanov, Adliswil (CH); Justas Klimavicius, Zurich (CH); Yihui Ma, Zurich (CH); and Lukas Lopatovsky, Kilchberg (CH)
Assigned to GOOGLE LLC, Mountain View, CA (US)
Filed by GOOGLE LLC, Mountain View, CA (US)
Filed on Nov. 29, 2021, as Appl. No. 17/536,938.
Application 17/536,938 is a continuation of application No. 16/614,241, granted, now 11,189,264, previously published as PCT/US2019/042204, filed on Jul. 17, 2019.
Claims priority of provisional application 62/871,571, filed on Jul. 8, 2019.
Prior Publication US 2022/0084503 A1, Mar. 17, 2022
Int. Cl. G10L 15/07 (2013.01); G10L 15/08 (2006.01); G10L 15/183 (2013.01)
CPC G10L 15/07 (2013.01) [G10L 15/083 (2013.01); G10L 15/183 (2013.01)] 14 Claims
OG exemplary drawing
 
1. A method for performing speech recognition on a spoken utterance from a user, the method implemented by one or more processors and comprising:
processing, at a speech recognition engine of a computing device, audio data corresponding to the spoken utterance;
generating, based on processing the audio data and by the speech recognition engine of the computing device, a plurality of current speech recognition hypotheses,
wherein each current speech recognition hypothesis of the plurality of current speech recognition hypotheses includes corresponding terms that are predicted to correspond to original natural language content of the spoken utterance from the user;
identifying term correspondence data that characterizes relationships between previous terms provided in previous speech recognition hypotheses generated based on previous spoken utterances from the user;
determining, based on the term correspondence data, that a given term, of at least a given current hypothesis of the plurality of current speech recognition hypotheses:
is included in the term correspondence data, and
corresponds to a related term of at least a given previous hypothesis of the previous speech recognition hypotheses, in the term correspondence data, that is not included in any of the plurality of current speech recognition hypotheses based at least in part on the given term sharing a common position in the given current hypothesis with the related term in the given previous speech recognition hypothesis;
based on determining that the given term is included in the term correspondence data and corresponds to the related term that is not included in any of the plurality of current speech recognition hypotheses:
generating, via the speech recognition engine of the computing device, a supplemental current speech recognition hypothesis that conforms to the given current hypothesis, but replaces the given term with the related term; and
selecting the supplemental current speech recognition hypothesis as an actual speech recognition result; and
in response to the selecting, causing the computing device to render an output based on the supplemental current speech recognition hypothesis.