US 11,954,149 B2
Multiple stage indexing of audio content
Peter C. DiMaria, Berkeley, CA (US); Markus K. Cremer, Orinda, CA (US); Barnabas Mink, San Francisco, CA (US); Tanji Koshio, Emeryville, CA (US); and Kei Tsuji, Emeryville, CA (US)
Assigned to Gracenote, Inc., Emeryville, CA (US)
Filed by Gracenote, Inc., Emeryville, CA (US)
Filed on Oct. 27, 2022, as Appl. No. 18/050,326.
Application 18/050,326 is a continuation of application No. 17/140,992, filed on Jan. 4, 2021, granted, now 11,487,814.
Application 17/140,992 is a continuation of application No. 15/475,459, filed on Mar. 31, 2017, granted, now 10,885,109, issued on Jan. 5, 2021.
Prior Publication US 2023/0071565 A1, Mar. 9, 2023
This patent is subject to a terminal disclaimer.
Int. Cl. G06F 16/61 (2019.01); G06F 16/683 (2019.01)
CPC G06F 16/683 (2019.01) [G06F 16/61 (2019.01)] 20 Claims
OG exemplary drawing
 
1. A computer-implemented method comprising:
determining, by at least one hardware processor, a representative audio content for a cluster, wherein the cluster comprises at least two audio contents;
loading, by the at least one hardware processor, the representative audio content into an index, wherein the representative audio content is stored in association with a hash value, and wherein the hash value is associated with a candidate reference identifier;
removing, by the at least one hardware processor, candidate reference identifiers that appear less than a threshold number of times;
in response to removing the candidate reference identifiers that appear less than a threshold number of times, generating a first comparison, by the at least one hardware processor, of a query audio content to each representative audio content associated with a remaining set of candidate reference identifiers, wherein the remaining set of candidate reference identifiers does not include the removed candidate reference identifiers, and wherein the first comparison is generated using a first matching criteria; and
matching, by the at least one hardware processor, the query audio content to one of the representative audio content based on the generated first comparison.