| CPC G10L 21/04 (2013.01) [G10L 19/00 (2013.01); H04N 21/2335 (2013.01); H04N 21/234336 (2013.01); G10H 2210/385 (2013.01); H04N 21/43072 (2020.08)] | 20 Claims |

|
1. A computer-implemented method, comprising:
receiving a media file to be transcoded, the media file including at least an audio portion and a video portion;
determining, based at least in part upon metadata associated with the media file, that a total duration of the audio portion is different from an expected duration;
determining, based at least in part upon the metadata, an average frame duration across a plurality of audio frames of the audio portion;
determining, for the plurality of audio frames, that individual frame durations for the plurality of audio frames fall within a permissible threshold of the average frame duration;
applying a resampling filter to the plurality of frames in order to apply a constant amount of duration adjustment across the plurality of audio frames, wherein the total duration of the resampled audio portion equals the expected duration; and
performing transcoding of the media file including the video portion and the resampled audio portion.
|