CPC H04N 9/646 (2013.01) [G06T 3/40 (2013.01); G06T 5/70 (2024.01); G06T 7/90 (2017.01); G06T 2207/10024 (2013.01); G06T 2207/20084 (2013.01); G06T 2207/20221 (2013.01)] | 19 Claims |
1. A computer-implemented method, comprising:
receiving and processing a first color image by an encoder,
wherein
the first color image comprises a first portion of the first color image and a second portion of the first color image located at different locations of the first color image; and
the encoder is configured to output at least one first feature map comprising fused global information and local information such that whether a color consistency relationship between the first portion of the first color image and the second portion of the first color image exists is encoded into the fused global information and local information, wherein the encoder comprises:
a first block;
a second block; and
a first skip connection,
wherein
the first block comprises:
a convolutional block configured to output at least one second feature map comprising local information and has a first receptive field; and
the second block comprises:
a global pooling layer configured to perform global pooling on the at least one second feature map, and output at least one third feature map comprising global information, and has a second receptive field wider than the first receptive field; and
an upscaling layer configured to upscale the at least one third feature map and output at least one fourth feature map having a same scale as the at least one second feature map and comprising the global information; and
the first skip connection is configured to fuse the at least one second feature map and the at least one fourth feature map, to generate the at least one first feature map, such that the at least one first feature map has a same number of channels as a number of channels of the at least one second feature map, wherein the fused global information and local information is obtained from the global information and the local information.
|