US 12,462,829 B1
Audio resampling for media synchronization
Jordan Gilman, Portland, OR (US); Trevor Wells, Washougal, WA (US); and Max C. Denton, Portland, OR (US)
Assigned to Amazon Technologies, Inc., Reno, NV (US)
Filed by Amazon Technologies, Inc., Reno, NV (US)
Filed on Jun. 30, 2022, as Appl. No. 17/854,478.
Claims priority of provisional application 63/344,916, filed on May 23, 2022.
Int. Cl. G10L 21/04 (2013.01); G10L 19/00 (2013.01); H04N 21/233 (2011.01); H04N 21/2343 (2011.01); H04N 21/43 (2011.01)
CPC G10L 21/04 (2013.01) [G10L 19/00 (2013.01); H04N 21/2335 (2013.01); H04N 21/234336 (2013.01); G10H 2210/385 (2013.01); H04N 21/43072 (2020.08)] 20 Claims
OG exemplary drawing
 
1. A computer-implemented method, comprising:
receiving a media file to be transcoded, the media file including at least an audio portion and a video portion;
determining, based at least in part upon metadata associated with the media file, that a total duration of the audio portion is different from an expected duration;
determining, based at least in part upon the metadata, an average frame duration across a plurality of audio frames of the audio portion;
determining, for the plurality of audio frames, that individual frame durations for the plurality of audio frames fall within a permissible threshold of the average frame duration;
applying a resampling filter to the plurality of frames in order to apply a constant amount of duration adjustment across the plurality of audio frames, wherein the total duration of the resampled audio portion equals the expected duration; and
performing transcoding of the media file including the video portion and the resampled audio portion.