| CPC G06T 7/251 (2017.01) [G06T 7/75 (2017.01); G06V 40/28 (2022.01); G06T 2207/20081 (2013.01); G06T 2207/30196 (2013.01)] | 19 Claims |

|
1. A computing system for hand tracking, the system comprising:
one or more processors; and
one or more non-transitory computer-readable media that collectively store instructions that, when executed by the one or more processors, cause the one or more processors to perform operations, the operations comprising:
obtaining a first image frame, wherein the first image frame is descriptive of a hand comprising a palm;
processing the first image frame with a machine-learned palm detection model to generate one or more oriented bounding boxes associated with a position of the palm, wherein the position of the palm is determined with the machine-learned palm detection model based on one or more features in the first image frame, wherein the one or more oriented bounding boxes are descriptive of an orientation of at least one of the palm or the hand;
processing the one or more oriented bounding boxes with a machine-learned hand landmark model to determine a first plurality of hand landmark positions within the first image frame based at least in part on the one or more oriented bounding boxes;
generating data indicative of a hand skeleton corresponding to a first palm detected in the first image frame based at least in part on three-dimensional coordinates corresponding to the first plurality of hand landmark positions within the first image frame;
determining a set of finger states associated with the hand skeleton based at least in part on an accumulated angle of joints associated with each finger of the hand skeleton; and
determining whether the first image frame is associated with one or more of a plurality of gestures based at least in part on mapping the set of finger states to a set of pre-defined gestures.
|