US 12,382,238 B2
XR rendering for 3D audio content and audio codec
Ferdinando Olivieri, Carlsbad, CA (US); and Taher Shahbazi Mirzahasanloo, San Diego, CA (US)
Assigned to QUALCOMM Incorporated, San Diego, CA (US)
Filed by QUALCOMM Incorporated, San Diego, CA (US)
Filed on Jul. 30, 2021, as Appl. No. 17/444,138.
Prior Publication US 2023/0051841 A1, Feb. 16, 2023
Int. Cl. H04S 7/00 (2006.01); G02B 27/00 (2006.01); G02B 27/01 (2006.01); G06F 3/01 (2006.01); G10L 19/008 (2013.01); H04R 5/04 (2006.01)
CPC H04S 7/303 (2013.01) [G02B 27/0093 (2013.01); G02B 27/017 (2013.01); G06F 3/013 (2013.01); G10L 19/008 (2013.01); H04R 5/04 (2013.01); G02B 2027/0147 (2013.01); H04S 2400/01 (2013.01); H04S 2400/11 (2013.01); H04S 2420/01 (2013.01)] 30 Claims
OG exemplary drawing
 
25. A method comprising:
receiving, at one or more processors included in a playback device, an audio bitstream including encoded audio data and metadata sent from another device, the encoded audio data corresponding to one or more sound sources, and the metadata indicative of one or more directions of the one or more sound sources;
decoding, at the one or more processors, the encoded audio data of the audio bitstream to generate decoded audio data;
obtaining, at the one or more processors, direction data at the playback device, indicating a viewing direction associated with a user of the playback device;
obtaining motion sensor data, at the one or more processors, from one or more sensors coupled to the playback device;
adjusting, to one or more adjusted directions at the one or more processors, the one or more directions of the one or more sound sources indicated in the metadata to compensate for the motion sensor data;
determining, at the one or more processors, a resolution setting that indicates a number of de-quantization bits used for rendering the decoded audio data for a first sound source of the one or more sound sources, the resolution sitting for the first sound source based on a similarity between the viewing direction and a direction of the one or more adjusted directions corresponding to the first sound source; and
rendering, at the one or more processors, the decoded audio data for the first sound source based on the motion sensor data and the resolution setting for the first sound source to generate rendered audio data for the first sound source.