US 11,874,814 B2
Methods and apparatus for efficient media indexing
Matthew James Wilkinson, Emeryville, CA (US); Jeffrey Scott, Berkeley, CA (US); Robert Coover, Orinda, CA (US); and Konstantinos Antonios Dimitriou, San Francisco, CA (US)
Assigned to Gracenote, Inc., Emeryville, CA (US)
Filed by Gracenote, Inc., Emeryville, CA (US)
Filed on Mar. 7, 2022, as Appl. No. 17/688,632.
Application 17/688,632 is a continuation of application No. 16/561,908, filed on Sep. 5, 2019, granted, now 11,269,840.
Claims priority of provisional application 62/727,908, filed on Sep. 6, 2018.
Claims priority of application No. 20180100409 (GR), filed on Sep. 6, 2018.
Prior Publication US 2022/0197882 A1, Jun. 23, 2022
This patent is subject to a terminal disclaimer.
Int. Cl. G06F 16/22 (2019.01); G06F 16/41 (2019.01); G06F 7/58 (2006.01)
CPC G06F 16/2255 (2019.01) [G06F 16/41 (2019.01); G06F 7/588 (2013.01)] 20 Claims
OG exemplary drawing
 
1. An apparatus, comprising:
means for initiating a list of hash seeds, the list of hash seeds including at least a first hash seed value and a second hash seed value among other hash seed values;
means for generating a first bucket distribution and a second bucket distribution comprising:
generating the first bucket distribution based on the first hash seed value and a first hash function; and
generating the second bucket distribution based on the second hash seed value used in combination with the first hash seed value;
means for determining a first entropy value and a second entropy value comprising:
determining the first entropy value of the first bucket distribution, wherein data associated with the first bucket distribution is stored in a first hash table; and
determining the second entropy value of the second bucket distribution;
means for selecting the second hash seed value to store in a second hash table when the second entropy value is greater than a plurality of other entropy values associated with other bucket distributions, the other bucket distributions generated using the other hash seed values, the first hash seed value or the second hash seed value used to identify a location of one or more subfingerprints associated with audio fingerprint data, the first entropy value or the second entropy value associated with a distribution of the one or more subfingerprints; and
means for retrieving the audio fingerprint data using the first hash table or the second hash table to reduce fingerprint search times.