CPC H04N 21/8547 (2013.01) [H04N 21/233 (2013.01); H04N 21/23418 (2013.01); H04N 21/242 (2013.01)] | 20 Claims |
1. A method comprising:
performing, using a video classifier, video reference point classification of a video stream based on an audio-video dataset; wherein the audio-video dataset comprises audio-video data of objects causing a Doppler-effect sound in a corresponding audio stream;
performing, using an audio classifier, audio reference point classification of the corresponding audio stream based on the audio-video dataset;
identifying, based on the video reference point classification, a set of video segments comprising object related reference points in the video stream; wherein the object related reference points in the video stream include the Doppler-effect sound in the audio stream;
identifying, based on the audio reference point classification, a set of audio segments comprising object related reference points of the Doppler-effect sound in the audio stream;
correlating object related reference points in the video segments of the video stream and in the audio segments of the audio stream to identify a set of audio-video synchronization candidates; and
comparing context of the set of the audio-video synchronization candidates to identify an audio-video synchronization candidate to synchronize the audio stream and the video stream.
|