CPC G10L 15/06 (2013.01) [G10L 15/16 (2013.01); G10L 15/22 (2013.01); G10L 15/28 (2013.01); G10L 25/90 (2013.01); G10L 2015/088 (2013.01); G10L 2025/783 (2013.01)] | 20 Claims |
1. A computer-implemented method executed on data processing hardware that causes the data processing hardware to perform operations comprising:
receiving a first acoustic segment characterizing a hotword detected by a hotword detector in streaming audio captured by a user device;
without performing speech recognition processing on the streaming audio, extracting one or more hotword attributes from the first acoustic segment;
adjusting, based on the one or more hotword attributes extracted from the first acoustic segment, one or more speech recognition parameters of an automated speech recognition (ASR) model; and
after adjusting the speech recognition parameters of the ASR model, processing, using the ASR model, a second acoustic segment to generate a speech recognition result, the second acoustic segment characterizing a spoken query/command that follows the first acoustic segment in the streaming audio captured by the user device.
|