| CPC H04S 7/30 (2013.01) [G06V 20/46 (2022.01); G11B 27/036 (2013.01); G06V 2201/10 (2022.01); H04S 2400/11 (2013.01)] | 18 Claims |

|
1. A computer-implemented method comprising:
creating, during content production, an audio object and metadata associated with the audio object based on a motion vector analysis of an object in one or more image frames in a video;
inserting, during the content production, the audio object and the metadata associated with the audio object into at least one of an audio encoder or a video encoder;
applying one or more spatial rules to generate immersive sound from mono sound based on a motion vector extracted from the one or more image frames; and
rendering, during content playback, the audio object, without image frame analysis, based on decoding the audio object and parsing the metadata associated with the audio object.
|