| CPC G06T 11/60 (2013.01) [G06T 7/194 (2017.01); G06T 9/00 (2013.01); G06T 2207/20081 (2013.01)] | 7 Claims |

|
1. A method of generating an image, comprising:
obtaining an input description and an input image depicting a subject;
encoding the input description using a text encoder of an image generation model to obtain a text embedding;
encoding the input image using a subject encoder of the image generation model to obtain a subject embedding;
generating a guidance embedding by combining the subject embedding and the text embedding; and
generating an output image based on the guidance embedding using a diffusion model of the image generation model, wherein the output image depicts one or more aspects of the input image and the input description.
|