US 11,887,216 B2
	High resolution conditional face generation
Ratheesh Kalarot, San Jose, CA (US); Timothy M. Converse, San Francisco, CA (US); Shabnam Ghadar, Menlo Park, CA (US); John Thomas Nack, San Jose, CA (US); Jingwan Lu, Santa Clara, CA (US); Elya Shechtman, Seattle, WA (US); Baldo Faieta, San Francisco, CA (US); and Akhilesh Kumar, San Jose, CA (US)
Assigned to ADOBE, INC., San Jose, CA (US)
Filed by ADOBE INC., San Jose, CA (US)
Filed on Nov. 19, 2021, as Appl. No. 17/455,796.
Prior Publication US 2023/0162407 A1, May 25, 2023
Int. Cl. G06T 5/00 (2006.01); G06T 11/00 (2006.01); G06N 3/08 (2023.01); G06V 40/16 (2022.01)

CPC G06T 11/00 (2013.01) [G06N 3/08 (2013.01); G06V 40/168 (2022.01); G06V 40/172 (2022.01)]

20 Claims

1. A method for image processing, comprising:

receiving an input image of a face comprising a plurality of input attributes and a plurality of input facial landmarks;

encoding the image to obtain a joint conditional vector representing the plurality of input attributes and the plurality of input facial landmarks using a joint embedding component, wherein the joint conditional vector is an input conditional vector to a mapping component of a generative adversarial network, and wherein the joint embedding component is trained to encode the plurality of input attributes and the plurality of input facial landmarks using an attribute loss based on a one-dimensional attribute vector with a plurality of values corresponding to the plurality of input attributes, respectively, and a landmark loss based on a landmark vector;

generating a latent vector based on the joint conditional vector and an identity vector using the mapping component of the generative adversarial network; and

generating a modified image based on the latent vector using the generative adversarial network.