CPC G06F 16/686 (2019.01) [G06F 16/61 (2019.01); G06F 17/14 (2013.01); G10L 25/27 (2013.01); G10L 25/51 (2013.01)] | 20 Claims |
1. A non-transitory computer-readable medium comprising instructions that, when executed, cause one or more processors to perform a set of operations comprising:
binarizing one or more constant Q transformed time slices of query audio;
generating two-dimensional Fourier transforms of time windows within the binarized one or more constant Q transformed time slices;
ordering the two-dimensional Fourier transforms in a query data structure; and
identifying the query audio as a cover rendition of reference audio based on a comparison between the query data structure and a reference data structure associated with the reference audio.
|