CPC G11B 27/02 (2013.01) [G06N 3/045 (2023.01); G06N 3/08 (2013.01); G06T 9/002 (2013.01)] | 21 Claims |
1. An apparatus for inserting identity information from a source image of a first subject into a destination video of a second subject different than the first subject while mimicking motion in the destination video, said apparatus comprising:
an identity encoder configured to encode identity information from the source image and to produce an identity vector;
a driver encoder comprising a pose encoder configured to encode pose information from the destination video and to produce a pose vector, and a separate and independent motion encoder configured to encode motion information from the destination video and to produce a motion vector; and
a neural network generator having three inputs: the identity vector, the pose vector, and the motion vector; wherein
the neural network generator is configured to generate, in response to the three inputs, a composite video comprising identity information from the source video inserted into the destination video, where the composite video has substantially the same temporal information as in the destination video.
|