US 12,067,690 B2
	Image processing method and apparatus, device, and storage medium
Tianyu Sun, Shenzhen (CN); Haozhi Huang, Shenzhen (CN); and Wei Liu, Shenzhen (CN)
Assigned to TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED, Shenzhen (CN)
Filed by TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED, Shenzhen (CN)
Filed on Oct. 8, 2021, as Appl. No. 17/497,883.
Application 17/497,883 is a continuation of application No. PCT/CN2020/117455, filed on Sep. 24, 2020.
Claims priority of application No. 201911072470.8 (CN), filed on Nov. 5, 2019.
Prior Publication US 2022/0028031 A1, Jan. 27, 2022
Int. Cl. G06T 3/04 (2024.01); G06F 18/214 (2023.01); G06N 3/045 (2023.01); G06N 3/088 (2023.01); G06T 9/00 (2006.01); G06V 10/94 (2022.01); G06V 40/16 (2022.01)

CPC G06T 3/04 (2024.01) [G06F 18/214 (2023.01); G06N 3/045 (2023.01); G06N 3/088 (2013.01); G06T 9/00 (2013.01); G06V 10/95 (2022.01); G06V 40/168 (2022.01); G06V 40/174 (2022.01)]

17 Claims

1. An image processing method, applied to a computer device, the method comprising:

encoding an input image based on an attention mechanism to obtain an encoding tensor set and an attention map set of the input image, the encoding tensor set including n encoding tensors, the attention map set including n attention maps, and n being an integer greater than 1, comprising:

determining, for each group of a corresponding encoding tensor and a corresponding attention map in the encoding tensor set and the attention map set, a processed encoding tensor based on the encoding tensor and the attention map, to obtain n processed encoding tensors;

obtaining an encoding result of the input image according to the encoding tensor set and the attention map set, the encoding result of the input image recording an identity feature of a human face in the input image;

encoding an expression image to obtain an encoding result of the expression image, the encoding result of the expression image recording an expression feature of a human face in the expression image; and

generating an output image according to the encoding result of the input image and the encoding result of the expression image, the output image having the identity feature of the input image and the expression feature of the expression image, the encoding result of the expression image including n displacement maps, comprising:

performing spatial transformation processing, for each group of a corresponding processed encoding tensor and a corresponding displacement map, on the processed encoding tensor to obtain n transformed encoding tensors; and

decoding the n transformed encoding tensors to generate the output image.