| CPC G10L 17/04 (2013.01) [G10L 17/06 (2013.01); G10L 25/24 (2013.01)] | 18 Claims |

|
9. A computer-implemented method for speaker verification, comprising:
receiving, by a signal processor, an audio signal from an unknown speaker;
extracting, by the signal processor, a first feature from the audio signal, where the first feature is indicative of variability of the audio signal, wherein the first feature is derived by grouping data samples of the audio signal into frames and, for each frame, quantizing each data sample in a given frame in accordance with a difference in magnitude of a data sample with magnitude of a reference data sample in the given frame, thereby creating a pattern of values indicative of the variability of the audio signal;
extracting, by the signal processor, additional features from the audio signal, where the additional features represent the power spectrum of the audio signal;
constructing, by the signal processor, a feature vector by concatenating the first feature with the additional features;
classifying, by a first classifier, the audio signal using the feature vector, where the first classifier is trained to identify recorded audio signals;
classifying, by a second classifier, the audio signal using the feature vector, where the second classifier is trained to identify computer generated audio signals;
classifying, by a third classifier, the audio signal using the feature vector, where the third classifier is trained to identify authentic audio signals; and
labeling the audio signal as one of authentic, record or computer generated based on output from the first classifier, the second classifier and the third classifier.
|