US 12,003,946 B2
Adaptable spatial audio playback
Alan J. Seefeldt, Alameda, CA (US); Joshua B. Lando, Mill Valley, CA (US); Daniel Arteaga, Barcelona (ES); Glenn N. Dickins, Como (AU); and Mark Richard Paul Thomas, Walnut Creek, CA (US)
Assigned to DOLBY LABORATORIES LICENSING CORPORATION, San Francisco, CA (US); and DOLBY INTERNATIONAL AB, Dublin (IE)
Appl. No. 17/630,098
Filed by DOLBY LABORATORIES LICENSING CORPORATION, San Francisco, CA (US); and DOLBY INTERNATIONAL AB, Amsterdam Zuidoost (NL)
PCT Filed Jul. 16, 2020, PCT No. PCT/US2020/042391
§ 371(c)(1), (2) Date Jan. 25, 2022,
PCT Pub. No. WO2021/021460, PCT Pub. Date Feb. 4, 2021.
Claims priority of provisional application 62/705,410, filed on Jun. 25, 2020.
Claims priority of provisional application 62/705,351, filed on Jun. 23, 2020.
Claims priority of provisional application 62/992,068, filed on Mar. 19, 2020.
Claims priority of provisional application 62/971,421, filed on Feb. 7, 2020.
Claims priority of provisional application 62/949,998, filed on Dec. 18, 2019.
Claims priority of provisional application 62/880,114, filed on Jul. 30, 2019.
Claims priority of application No. ES201930702 (ES), filed on Jul. 30, 2019; and application No. 19217580 (EP), filed on Dec. 18, 2019.
Prior Publication US 2022/0337969 A1, Oct. 20, 2022
Int. Cl. H04S 7/00 (2006.01); H04R 5/02 (2006.01)
CPC H04S 7/302 (2013.01) [H04R 5/02 (2013.01)] 29 Claims
OG exemplary drawing
 
1. An audio processing system, comprising:
an interface system; and
a control system configured for:
receiving audio data via the interface system, the audio data including one or more audio signals and associated spatial data, the spatial data indicating an intended perceived spatial position corresponding to an audio signal, the spatial data including at least one of channel data or spatial metadata;
receiving, via the interface system, a rendering mode indication, wherein receiving the rendering mode indication involves receiving an indication of a number of people in a listening area;
determining a rendering mode based, at least in part, on the number of people in the listening area;
rendering the audio data for reproduction via a set of loudspeakers of an environment according to the rendering mode, to produce rendered audio signals, wherein:
rendering the audio data comprises determining relative activation of the set of loudspeakers in an environment;
the rendering mode is variable between a reference spatial mode and one or more distributed spatial modes;
the reference spatial mode has an assumed listening position and orientation; and
in the one or more distributed spatial modes, one or more elements of the audio data is or are each rendered in a more spatially distributed manner than in the reference spatial mode and spatial locations of remaining elements of the audio data are warped such that they span a rendering space of the environment more completely than in the reference spatial mode; and
providing, via the interface system, the rendered audio signals to at least some loudspeakers of the set of loudspeakers of the environment.