US 12,094,490 B2
Cover song identification method and system
Xiaochen Liu, Jersey City, NJ (US); Joseph P. Renner, Lille (FR); Joshua E. Morris, Nashville, TN (US); Todd J. Hodges, Oakland, CA (US); Robert Coover, Orinda, CA (US); and Zafar Rafii, Berkeley, CA (US)
Assigned to Gracenote, Inc., New York, NY (US)
Filed by Gracenote, Inc., Emeryville, CA (US)
Filed on Jun. 1, 2021, as Appl. No. 17/335,657.
Claims priority of provisional application 63/133,042, filed on Dec. 31, 2020.
Prior Publication US 2022/0208217 A1, Jun. 30, 2022
Int. Cl. G10L 25/90 (2013.01); G06F 16/632 (2019.01); G06F 16/683 (2019.01); G06N 3/08 (2023.01); G06N 20/00 (2019.01); G10L 19/022 (2013.01)
CPC G10L 25/90 (2013.01) [G06F 16/634 (2019.01); G06F 16/685 (2019.01); G06N 3/08 (2013.01); G06N 20/00 (2019.01); G10L 19/022 (2013.01)] 16 Claims
OG exemplary drawing
 
1. A computer-implemented cover song identification method comprising:
receiving, by a computing system and from a user device, harmonic pitch class profile (HPCP) information that specifies one or more HPCP features associated with target audio content;
deriving, from the one or more HPCP features, a major chord profile feature and a minor chord profile feature associated with the target audio content, wherein deriving the major chord profile feature and the minor chord profile feature associated with the target audio content comprises selecting HPCP features that correlate with twelve different major chords and twelve different minor chords, time aligning the selected HPCP features to a nearest bar line or measure based on an estimated tempo and beat associated with the target audio content, and normalizing the time aligned selected HPCP features, to emphasize a sequential structure of the target audio content with major and minor chords that are present in the target audio content;
determining, by machine learning logic of the computing system and based on the major chord profile feature and the minor chord profile feature, a relatedness between the target audio content and each of a plurality of audio content items specified in records of a database, wherein each audio content item is associated with cover song information; and
communicating, to the user device, cover song information associated with an audio content item having a highest relatedness to the target audio content.