US 12,266,147 B2
	Hand posture estimation method, apparatus, device, and computer storage medium
Yang Zhou, Palo Alto, CA (US)
Assigned to GUANGDONG OPPO MOBILE TELECOMMUNICATIONS CORP., LTD., Guangdong (CN)
Filed by GUANGDONG OPPO MOBILE TELECOMMUNICATIONS CORP., LTD., Guangdong (CN)
Filed on May 18, 2022, as Appl. No. 17/747,837.
Application 17/747,837 is a continuation of application No. PCT/CN2020/122933, filed on Oct. 22, 2020.
Claims priority of provisional application 62/938,190, filed on Nov. 20, 2019.
Prior Publication US 2022/0358326 A1, Nov. 10, 2022
Int. Cl. G06V 10/44 (2022.01); G06T 7/73 (2017.01); G06V 10/82 (2022.01); G06V 40/10 (2022.01)

CPC G06V 10/44 (2022.01) [G06T 7/73 (2017.01); G06V 10/82 (2022.01); G06V 40/11 (2022.01); G06T 2207/20084 (2013.01); G06T 2207/30196 (2013.01)]

18 Claims

1. A hand posture estimation method, comprising:

obtaining an initial feature map corresponding to a hand region in a candidate image;

obtaining a fused feature map by performing feature fusion processing on the initial feature map; wherein the feature fusion processing is configured to fuse features around a plurality of key points;

obtaining a target feature map by performing deconvolution processing on the fused feature map; wherein the deconvolution processing is configured to adjust a resolution of the fused feature map; and

obtaining coordinate information of the plurality of key points based on the target feature map to determine a posture estimation result of the hand region in the candidate image;

wherein the obtaining a fused feature map by performing feature fusion processing on the initial feature map comprises:

obtaining a first feature map by performing a first convolution processing on the initial feature map through a first convolutional network; wherein the first convolution processing is configured to extract local detail information of the plurality of key points; and

wherein before the performing a first convolution processing on the initial feature map through a first convolutional network, the method further comprises:

obtaining a dimensionality-reduced feature map by performing dimensionality reduction processing on the initial feature map; wherein the dimensionality reduction processing is configured to reduce the number of channels of the initial feature map; and

obtaining the first feature map by performing the first convolution processing on the dimensionality-reduced feature map through the first convolutional network.