US 12,277,738 B2
Method and system for latent-space facial feature editing in deep learning based face swapping
Sirak Ghebremusse, San Francisco, CA (US); Stéphane Grabli, San Francisco, CA (US); Jacek Krzysztof Naruniec, Zürich (CH); Romann Matthew Weber, Zürich (CH); and Christopher Richard Schroers, Zürich (CH)
Assigned to Lucasfilm Entertainment Company Ltd. LLC, San Francisco, CA (US); and Disney Enterprises, Inc., Burbank, CA (US)
Filed by LUCASFILM ENTERTAINMENT COMPANY LTD. LLC, San Francisco, CA (US); and DISNEY ENTERPRISES, INC, Burbank, CA (US)
Filed on Mar. 29, 2022, as Appl. No. 17/707,782.
Prior Publication US 2023/0316587 A1, Oct. 5, 2023
Int. Cl. G06T 9/00 (2006.01); G06T 7/70 (2017.01); G06T 11/00 (2006.01); G06V 40/16 (2022.01)
CPC G06T 9/002 (2013.01) [G06T 7/70 (2017.01); G06T 11/00 (2013.01); G06V 40/168 (2022.01); G06T 2200/24 (2013.01); G06T 2207/10016 (2013.01); G06T 2207/20081 (2013.01); G06T 2207/20084 (2013.01); G06T 2207/20092 (2013.01); G06T 2207/30201 (2013.01)] 22 Claims
OG exemplary drawing
 
1. A computer-implemented method of changing a face within an image or video frame, the method comprising:
receiving an input image that includes a face presenting a facial expression in a pose;
processing the image with a neural network encoder to generate a latent space point that is an encoded representation of the image;
decoding the latent space point to generate an initial output image in accordance with a desired facial identity but with the facial expression and pose of the face in the input image;
identifying a feature of the facial expression in the initial output image to edit;
responsive to identifying the feature of the facial expression in the initial output image to edit, applying an adjustment vector to a latent space point corresponding to the initial output image to generate an adjusted latent space point, wherein applying the adjustment vector to the latent space point comprises translating the latent space point in latent space by adding the adjustment vector to the latent space point, wherein the adjusted latent space point comprises the latent space point after it has been translated; and
decoding the adjusted latent space point to generate an adjusted output image in accordance with the desired facial identity but with the facial expression and pose of the face in the input image altered in accordance with the adjustment vector.