CPC G10L 15/07 (2013.01) [G10L 15/005 (2013.01); G10L 15/063 (2013.01); G10L 15/083 (2013.01); H04N 21/2335 (2013.01); H04N 21/23418 (2013.01); H04N 21/234345 (2013.01); H04N 21/8106 (2013.01)] | 20 Claims |
1. A method of processing an original video file to generate a modified video file, the modified video file including a translated audio content of the original video file, the method comprising:
receiving the original video file for processing;
receiving a second video file of a different speaker than the speaker in the original video file, wherein the second video file is a video of a different speaker stating speech expressions;
accessing a model associating facial characteristics of the speaker in the original video file with portions of speech for those portions of replaced audio content; and
replacing facial expressions of the speaker in the original video file with facial expressions according to the video of the different speaker, on determination of a facial expression of the speaker matching a facial expression in the model.
|