| CPC G10L 25/51 (2013.01) [G11B 27/28 (2013.01)] | 20 Claims |

|
1. A non-transitory computer readable medium comprising instructions which, when executed, cause one or more processors to:
divide an audio signal into a plurality of audio segments, wherein the plurality of audio segments comprises a first audio segment, a second audio segment, and a third audio segment;
normalize the second audio segment to create a first normalized audio segment based on first audio characteristics of the first audio segment and second audio characteristics of the second audio segment and a second normalized audio segment based on third audio characteristics of the third audio segment and at least one of the first audio characteristics of the first audio segment and the second audio characteristics of the second audio segment;
generate a first subfingerprint from the first normalized audio segment, wherein the first subfingerprint comprises a first portion corresponding to a location of an energy extremum in the normalized second audio segment, and a second subfingerprint from the second normalized audio segment;
determine a likelihood of the first portion to change based on changes to at least one of the first audio characteristics, the second audio characteristics, and the third audio characteristics;
in response to determining the likelihood does not satisfy a threshold, replace the first portion with a second portion; and
determine if the second subfingerprint includes the first portion.
|