US 11,749,267 B2
Adapting hotword recognition based on personalized negatives
Aleksandar Kracun, New York, NY (US); and Matthew Sharifi, Kilchberg (CH)
Assigned to Google LLC, Mountain View, CA (US)
Filed by Google LLC, Mountain View, CA (US)
Filed on Nov. 20, 2020, as Appl. No. 16/953,510.
Prior Publication US 2022/0165277 A1, May 26, 2022
Int. Cl. G10L 15/22 (2006.01); G10L 15/197 (2013.01); G10L 17/06 (2013.01); G10L 17/24 (2013.01); G10L 15/30 (2013.01); G10L 15/08 (2006.01)
CPC G10L 15/22 (2013.01) [G10L 15/197 (2013.01); G10L 15/30 (2013.01); G10L 17/06 (2013.01); G10L 17/24 (2013.01); G10L 2015/088 (2013.01); G10L 2015/223 (2013.01)] 30 Claims
OG exemplary drawing
 
1. A method comprising:
receiving, at data processing hardware, audio data characterizing a hotword event detected by a first stage hotword detector in streaming audio captured by a user device;
processing, by the data processing hardware, using a second stage hotword detector, the audio data to determine whether a hotword is detected by the second stage hotword detector in a first segment of the audio data, the second stage hotword detector different from the first stage hotword detector; and
when the hotword is not detected by the second stage hotword detector in the first segment of the audio data:
classifying, by the data processing hardware, the first segment of the audio data as containing a negative hotword that caused a false detection of the hotword event in the streaming audio by the first stage hotword detector; and
based on the first segment of the audio data classified as containing the negative hotword, updating, by the data processing hardware, the first stage hotword detector to prevent triggering the hotword event in subsequent audio data that contains the negative hotword, the first stage hotword detector updated based on at least one of:
a hotword detection probability score determined by the second stage hotword detector for the first segment of the audio data; or
a negative hotword confidence score determined by the second stage hotword detector, the negative hotword confidence score classifying the first segment of the audio data as the negative hotword.