US 12,445,791 B2
Spatial audio rendering adaptive to signal level and loudspeaker playback limit thresholds
Alan J. Seefeldt, Alameda, CA (US); Joshua B. Lando, Mill Valley, CA (US); and Timothy Alan Port, Drummoyne (AU)
Assigned to Dolby Laboratories Licensing Corporation, San Francisco, CA (US)
Appl. No. 18/998,247
Filed by Dolby Laboratories Licensing Corporation, San Francisco, CA (US)
PCT Filed Jul. 21, 2023, PCT No. PCT/US2023/028378
§ 371(c)(1), (2) Date Jan. 24, 2025,
PCT Pub. No. WO2024/025803, PCT Pub. Date Feb. 1, 2024.
Claims priority of provisional application 63/505,652, filed on Jun. 1, 2023.
Claims priority of provisional application 63/413,923, filed on Oct. 6, 2022.
Claims priority of provisional application 63/392,794, filed on Jul. 27, 2022.
Prior Publication US 2025/0267417 A1, Aug. 21, 2025
Int. Cl. H04S 7/00 (2006.01); H04R 5/02 (2006.01)
CPC H04S 7/30 (2013.01) [H04R 5/02 (2013.01)] 24 Claims
OG exemplary drawing
 
1. An audio processing method, comprising:
receiving, by a control system and via an interface system, audio data, the audio data including one or more audio signals and associated spatial data, the spatial data indicating an intended perceived spatial position corresponding to an audio signal;
rendering, by the control system, the audio data for reproduction via a set of two or more loudspeakers of an environment, to produce loudspeaker signals, wherein:
rendering each of the one or more audio signals included in the audio data involves a mapping for each audio signal to the loudspeaker signals, the mapping being a time- and frequency-varying mapping;
the mapping for each audio signal is computed as a function of an audio signal's intended perceived spatial position, physical positions associated with the loudspeakers and a time- and frequency-varying representation of loudspeaker signal level relative to a maximum playback limit of each loudspeaker;
each mapping is computed to approximately achieve the intended perceived spatial position of an associated audio signal when the loudspeaker signals are played back over the set of loudspeakers located at associated loudspeaker positions;
a representation of loudspeaker signal level relative to a maximum playback limit is computed for each audio signal as a function of one or more of the audio signals and their perceived spatial positions; and
the mapping of an audio signal into a particular loudspeaker signal is reduced as the representation of loudspeaker signal level relative to a maximum playback limit increases above a threshold, while the mapping is increased into one or more other loudspeakers for which the representations of signal level relative to the maximum playback limits of one or more other loudspeakers are less than a threshold; and
providing, via the interface system, the loudspeaker signals to at least two loudspeakers of the set of loudspeakers of the environment.