CPC H04N 21/234363 (2013.01) [G06T 9/00 (2013.01); G06V 10/24 (2022.01); G06V 10/25 (2022.01); G06V 10/82 (2022.01); H04N 7/0117 (2013.01)] | 16 Claims |
1. A processor, comprising:
one or more circuits to:
align a plurality of images into frames of a first video using a neural network model comprising a latent diffusion model (LDM), wherein the first video has a first spatial resolution, the LDM comprises:
an encoder to map an input from an image space to a latent space; and
a decoder to map latent encoding from the latent space to the image space; and
generate a second video having a second spatial resolution by up-sampling the first video using an up-sampler neural network model, wherein the second spatial resolution is higher than the first spatial resolution, wherein the decoder is updated according to one or more temporal incoherencies in mapping the latent encoding from the latent space to the image space.
|