US 12,452,620 B2
Multi-channel speech compression system and method
Dushyant Sharma, Mountain House, CA (US); Patrick A. Naylor, Reading (GB); and Uwe Helmut Jost, Groton, MA (US)
Assigned to Microsoft Technology Licensing, LLC, Redmond, WA (US)
Filed by Microsoft Technology Licensing, LLC, Redmond, WA (US)
Filed on May 28, 2024, as Appl. No. 18/676,347.
Application 18/676,347 is a continuation of application No. 17/669,592, filed on Feb. 11, 2022, granted, now 11,997,469.
Claims priority of provisional application 63/183,848, filed on May 4, 2021.
Claims priority of provisional application 63/148,427, filed on Feb. 11, 2021.
Prior Publication US 2024/0323630 A1, Sep. 26, 2024
This patent is subject to a terminal disclaimer.
Int. Cl. H04S 7/00 (2006.01); G06T 7/70 (2017.01); G10L 15/06 (2013.01); G10L 15/22 (2006.01); G10L 19/00 (2013.01); G10L 19/008 (2013.01); G10L 19/16 (2013.01); G10L 21/0208 (2013.01); G10L 21/0216 (2013.01); H04R 1/40 (2006.01); H04R 3/00 (2006.01); H04R 5/027 (2006.01); H04S 3/00 (2006.01)
CPC H04S 7/30 (2013.01) [G06T 7/70 (2017.01); G10L 15/063 (2013.01); G10L 15/22 (2013.01); G10L 19/008 (2013.01); G10L 19/167 (2013.01); G10L 21/0208 (2013.01); H04R 1/406 (2013.01); H04R 3/005 (2013.01); H04R 5/027 (2013.01); H04S 3/008 (2013.01); G10L 2019/0001 (2013.01); G10L 2019/0002 (2013.01); G10L 2021/02166 (2013.01); H04R 2201/401 (2013.01); H04S 2400/01 (2013.01); H04S 2400/15 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A computer-implemented method, executed on a computing device, comprising:
encoding audio encounter information of a reference audio acquisition device of a plurality of audio acquisition devices of an audio recording system, thus defining encoded reference audio encounter information;
estimating, via a machine vision system, location information for an acoustic source within an acoustic environment;
accessing an acoustic relative transfer function codebook for the plurality of audio acquisition devices of the audio recording system, wherein the acoustic relative transfer function codebook includes a plurality of acoustic relative transfer functions between the reference audio acquisition device and the plurality of audio acquisition devices of the audio recording system, wherein each of the plurality of acoustic relative transfer functions is associated with a unique identifier for being uniquely identifiable from the plurality of acoustic relative transfer functions of the acoustic relative transfer function codebook based upon, at least in part, the location information and speakers within the acoustic environment, and wherein the acoustic relative transfer function codebook for the plurality of audio acquisition devices of the audio recording system includes a data structure configured to store mapping characteristics for mapping speech signals obtained by the reference audio acquisition device to speech signals of another audio acquisition device of the plurality of audio acquisition devices;
selecting one or more acoustic relative transfer functions from the acoustic relative transfer function codebook based upon, at least in part, the location information; and
transmitting the encoded reference audio encounter information and a representation of the selected one or more acoustic relative transfer functions.