US 11,962,991 B2
Non-coincident audio-visual capture system
Edward Stein, Soquel, CA (US); and Martin Walsh, Scotts Valley, CA (US)
Assigned to DTS, Inc., Calabasas, CA (US)
Appl. No. 17/625,407
Filed by DTS, Inc., Calabasas, CA (US)
PCT Filed Jul. 8, 2019, PCT No. PCT/US2019/040837
§ 371(c)(1), (2) Date Jan. 7, 2022,
PCT Pub. No. WO2021/006871, PCT Pub. Date Jan. 14, 2021.
Prior Publication US 2022/0272477 A1, Aug. 25, 2022
Int. Cl. H04S 7/00 (2006.01); H04R 3/00 (2006.01); H04R 5/027 (2006.01); H04S 3/00 (2006.01); H04S 3/02 (2006.01)
CPC H04S 7/30 (2013.01) [H04R 3/005 (2013.01); H04R 5/027 (2013.01); H04S 3/008 (2013.01); H04S 3/02 (2013.01); H04S 2400/01 (2013.01); H04S 2400/11 (2013.01); H04S 2400/15 (2013.01); H04S 2420/11 (2013.01)] 19 Claims
OG exemplary drawing
 
1. A method for updating a frame of reference for a spatial audio signal, the method comprising:
receiving a first spatial audio signal from an audio capture source, the audio capture source having a first frame of reference relative to an environment, and the first spatial audio signal including multiple signal components representing audio information from different depths or directions relative to a location of the audio capture source in the environment;
receiving information about a second frame of reference relative to the same environment, the second frame of reference corresponding to an image capture sensor;
determining a difference between the first and second frames of reference;
decomposing the first spatial audio signal into respective audio signal components, each audio signal component having a corresponding position in the environment;
selecting, based on the determined difference between the first and second frames of reference, respective filters for processing the audio signal components of the first spatial audio signal;
applying the selected filters to the respective audio signal components of the first spatial audio signal to generate respective spatially transformed components; and
using the spatially transformed components, generating a second spatial audio signal referenced to the second frame of reference.