CPC G10L 15/197 (2013.01) [G10L 15/04 (2013.01); G10L 15/22 (2013.01); G10L 15/30 (2013.01); G10L 2015/088 (2013.01); G10L 2015/223 (2013.01)] | 20 Claims |
1. A computer-implemented method comprising:
generating, by a computer, a plurality of segments of a plurality of audio signals, including one or more candidate segments of a first audio signal and one or more query segments of a second audio signal;
extracting, by the computer, a plurality of features for each segment of the plurality of segments;
determining, by the computer, for a candidate segment of the one or more candidate segments of the first audio signal a plurality of similarity scores corresponding to the one or more query segments of the second audio signal, each similarity score of the plurality of similarity scores indicating a similarity of the features of the candidate segment with respect to the features of a corresponding query segment of the one or more query segments of the second audio signal;
identifying, by the computer, a discovered keyword included in the candidate segment, in response to determining that at least one similarity score of the plurality of similarity scores of the candidate segment satisfies a pairwise matching threshold; and
updating, by the computer, a voice detection model associated with the plurality of audio signals to nullify a probability of a portion of an audio signal of the plurality of audio signals having the discovered keyword.
|