US 12,380,904 B2
Seamless scalable decoding of channels, objects, and HOA audio content
Moo Young Kim, San Diego, CA (US); Dipanjan Sen, Dublin, CA (US); Eric Allamanche, Sunnyvale, CA (US); J. Kevin Calhoun, Santa Rosa, CA (US); Frank Baumgarte, Sunnyvale, CA (US); Sina Zamani, Cupertino, CA (US); and Eric Day, San Jose, CA (US)
Assigned to Apple Inc., Cupertino, CA (US)
Appl. No. 18/246,024
Filed by Apple Inc., Cupertino, CA (US)
PCT Filed Sep. 10, 2021, PCT No. PCT/US2021/049744
§ 371(c)(1), (2) Date Mar. 20, 2023,
PCT Pub. No. WO2022/066426, PCT Pub. Date Mar. 31, 2022.
Claims priority of provisional application 63/083,794, filed on Sep. 25, 2020.
Prior Publication US 2023/0360660 A1, Nov. 9, 2023
Int. Cl. G10L 19/24 (2013.01); G10L 19/008 (2013.01)
CPC G10L 19/24 (2013.01) [G10L 19/008 (2013.01)] 20 Claims
OG exemplary drawing
 
20. A system configured to decode audio content, the system comprising:
a memory configured to store instructions;
a processor coupled to the memory and configured to execute the instructions stored in the memory to:
receive frames of the audio content, the audio content being represented by a plurality of content types, the frames containing audio streams encoding the audio content using an adaptive number of scene elements in the plurality of content types;
process two consecutive frames containing the audio streams encoding the audio content using a different mixture of the adaptive number of the scene elements in the plurality of content types to generate decoded audio streams; and
generate crossfading of the decoded audio streams in the two consecutive frames based on a speaker configuration to drive a plurality of speakers.