CPC G06F 16/683 (2019.01) [G06Q 50/184 (2013.01)] | 21 Claims |
15. A method comprising:
executing a constant Q transform on time slices of query audio data to output constant Q transformed time slices;
binarizing the constant Q transformed time slices to output binarized and constant transformed time slices;
executing a two-dimensional Fourier transform on time windows within the binarized and constant Q transformed time slices to output two-dimensional Fourier transforms of the time windows;
generating a reference data structure based on a sequential order of the two-dimensional Fourier transforms;
storing the reference data structure in a database; and
identifying a query data structure associated with query audio data as a cover rendition of the audio data based on a comparison of the query data structure and the reference data structure using a similarity matrix, wherein the similarity matrix indicates at least one degree to which reference portions of the reference data structure are associated with query portions of the query data structure, and wherein the at least one degree satisfies a corresponding threshold.
|