US 11,841,899 B2
	Spatial audio file format for storing capture metadata
Jonathan D. Sheaffer, San Jose, CA (US); Symeon Delikaris Manias, Los Angeles, CA (US); Gaetan R. Lorho, Menlo Park, CA (US); Peter A. Raffensperger, Cupertino, CA (US); Eric A. Allamanche, Sunnyvale, CA (US); Frank Baumgarte, Sunnyvale, CA (US); Dipanjan Sen, Dublin, CA (US); Joshua D. Atkins, Los Angeles, CA (US); and Juha O. Merimaa, San Mateo, CA (US)
Assigned to Apple Inc., Cupertino, CA (US)
Filed by Apple Inc., Cupertino, CA (US)
Filed on Jun. 11, 2020, as Appl. No. 16/899,019.
Claims priority of provisional application 62/868,738, filed on Jun. 28, 2019.
Prior Publication US 2020/0409995 A1, Dec. 31, 2020
Int. Cl. G06F 16/683 (2019.01); G06F 16/174 (2019.01); H04R 1/40 (2006.01); H04R 3/00 (2006.01)

CPC G06F 16/683 (2019.01) [G06F 16/1744 (2019.01); H04R 1/406 (2013.01); H04R 3/005 (2013.01); H04R 2410/00 (2013.01)]

20 Claims

1. A method for processing audio comprising:

capturing, by a plurality of microphones of a capture device, a plurality of microphone signals;

generating metadata that is independent of the microphone signals, the metadata including;

an impulse response for each microphone of the plurality of microphones of the capture device, wherein each impulse response defines a response of an acoustic impulse between a sound source and a respective microphone, and

a distance between the sound source and the capture device;

compressing the metadata;

storing, in an electronic audio data file,

the microphone signals, and

the compressed metadata that is independent of the microphone signals; and

sending the electronic audio data file to a receiving device for the receiving device to use the impulse responses and the distance to spatially render the plurality of microphone signals.