CPC G06T 3/0012 (2013.01) [G06F 18/214 (2023.01); G06N 3/045 (2023.01); G06N 3/08 (2013.01); G06T 3/0006 (2013.01); G06T 5/00 (2013.01); G06T 11/00 (2013.01); G06T 2207/20016 (2013.01); G06T 2207/20081 (2013.01); G06T 2207/20084 (2013.01); G06T 2207/30201 (2013.01)] | 17 Claims |
1. A method for generating a stylized image, the method comprising:
receiving an input image;
encoding the input image using a variational autoencoder to obtain a latent vector by:
passing the received input image through a headless pyramid network to produce multiple levels of features maps at different sizes;
encoding, for each of the levels of features maps at different sizes, each level's respective feature map at the different size with a separate encoder of a plurality of encoders to produce a code, and
combining the encoded code of each level's respective feature map to obtain the latent vector;
providing the latent vector to a pre-trained generative adversarial network (GAN) model;
generating, by the pre-trained GAN model, a stylized image from the pre-trained GAN model, the generated stylized image being a cartoon style image of the input image; and
providing the stylized image as an output,
wherein the pre-trained GAN model includes a multi-path structure corresponding to two or more different attributes.
|