US 12,430,812 B2
Text-guided cameo generation
Arnab Ghosh, Oxford (GB); Jian Ren, Marina Del Ray, CA (US); Pavel Savchenkov, London (GB); and Sergey Tulyakov, Marina del Rey, CA (US)
Assigned to Snap Inc., Santa Monica, CA (US)
Filed by Snap Inc., Santa Monica, CA (US)
Filed on Sep. 22, 2022, as Appl. No. 17/950,945.
Prior Publication US 2024/0104789 A1, Mar. 28, 2024
Int. Cl. G06T 11/00 (2006.01); G06F 40/289 (2020.01); G06F 40/35 (2020.01); G06V 40/16 (2022.01)
CPC G06T 11/00 (2013.01) [G06F 40/289 (2020.01); G06F 40/35 (2020.01); G06V 40/161 (2022.01)] 20 Claims
OG exemplary drawing
 
1. A computer-implemented method of generating an image including an existing representation of a face of a person, for use in a conversation taking place in a messaging application, the method comprising:
receiving conversation input text from a user of a portable device that includes a display;
generating model input text from the conversation input text;
generating an image based on the model input text using a text-to-image model;
determining coordinates of a face in the generated image;
applying the existing representation of the face of the person to the generated image based on the coordinates of the face in the generated image, to generate an updated image including the existing representation of the face of the person;
displaying the updated image on the display of the portable device;
receiving user input to transmit the updated image in a message; and
transmitting, in response to receiving the user input, the updated image to a remote recipient.