US 12,334,094 B2
Audio cancellation for voice recognition
Richard Mitic, Stockholm (SE); Robert Swain, Stockholm (SE); Daniel Bromand, Stockholm (SE); Waqar Sheikh, Malmo (SE); and James Robert Stansfield, Flyinge (SE)
Assigned to Spotify AB, Stockholm (SE)
Filed by Spotify AB, Stockholm (SE)
Filed on Nov. 17, 2022, as Appl. No. 18/056,611.
Application 18/056,611 is a continuation of application No. 17/158,312, filed on Jan. 26, 2021, granted, now 11,605,393.
Application 17/158,312 is a continuation of application No. 16/593,539, filed on Oct. 4, 2019, granted, now 10,943,599, issued on Mar. 9, 2021.
Claims priority of provisional application 62/820,762, filed on Mar. 19, 2019.
Claims priority of application No. 18202941 (EP), filed on Oct. 26, 2018.
Prior Publication US 2023/0162752 A1, May 25, 2023
Int. Cl. G10L 21/0232 (2013.01); G10L 15/20 (2006.01); G10L 15/22 (2006.01); G10L 25/51 (2013.01); H04R 3/00 (2006.01)
CPC G10L 21/0232 (2013.01) [G10L 25/51 (2013.01); H04R 3/00 (2013.01); G10L 15/20 (2013.01); G10L 15/22 (2013.01); G10L 2015/223 (2013.01); H04R 2420/07 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A media delivery system comprising:
a processor;
a memory storing data instructions that, when executed by the processor, cause the media delivery system to:
receive, from a sound system, a time delay between a first time associated with a generation of an audio cue at a media playback device of the sound system and a second time associated with when the audio cue is recorded in a recording in an audio buffer of the media playback device, wherein the audio cue represents a first signal sent at the first time and a second signal sent at a third time, wherein the first time and the third time are different, wherein a calibration value is used by the sound system in an audio cancellation operation to reduce background noise in the recording, wherein the calibration value is based on the time delay, and wherein the media playback device determines the time delay by:
determining a first time delay associated with the first signal;
determining a second time delay associated with the second signal; and
averaging the first time delay and the second time delay associated with the first and second signals to determine the time delay;
analyze, at the media delivery system, performance of the audio cancellation operation when operated using the calibration value; and
send instructions to the sound system to adjust the calibration value.