| CPC H04S 7/30 (2013.01) [G06T 7/70 (2017.01); G10L 15/063 (2013.01); G10L 15/22 (2013.01); G10L 19/008 (2013.01); G10L 19/167 (2013.01); G10L 21/0208 (2013.01); H04R 1/406 (2013.01); H04R 3/005 (2013.01); H04R 5/027 (2013.01); H04S 3/008 (2013.01); G10L 2019/0001 (2013.01); G10L 2019/0002 (2013.01); G10L 2021/02166 (2013.01); H04R 2201/401 (2013.01); H04S 2400/01 (2013.01); H04S 2400/15 (2013.01)] | 20 Claims |

|
1. A computer-implemented method, executed on a computing device, comprising:
encoding audio encounter information of a reference audio acquisition device of a plurality of audio acquisition devices of an audio recording system, thus defining encoded reference audio encounter information;
estimating, via a machine vision system, location information for an acoustic source within an acoustic environment;
accessing an acoustic relative transfer function codebook for the plurality of audio acquisition devices of the audio recording system, wherein the acoustic relative transfer function codebook includes a plurality of acoustic relative transfer functions between the reference audio acquisition device and the plurality of audio acquisition devices of the audio recording system, wherein each of the plurality of acoustic relative transfer functions is associated with a unique identifier for being uniquely identifiable from the plurality of acoustic relative transfer functions of the acoustic relative transfer function codebook based upon, at least in part, the location information and speakers within the acoustic environment, and wherein the acoustic relative transfer function codebook for the plurality of audio acquisition devices of the audio recording system includes a data structure configured to store mapping characteristics for mapping speech signals obtained by the reference audio acquisition device to speech signals of another audio acquisition device of the plurality of audio acquisition devices;
selecting one or more acoustic relative transfer functions from the acoustic relative transfer function codebook based upon, at least in part, the location information; and
transmitting the encoded reference audio encounter information and a representation of the selected one or more acoustic relative transfer functions.
|