CPC G06F 3/017 (2013.01) [G06V 40/20 (2022.01)] | 20 Claims |
1. A method comprising:
processing an input frame of a sequence of frames captured by a camera of a device to determine a location of at least one detected instance of a distinguishing anatomical feature in the input frame, the at least one detected instance of the distinguishing anatomical feature detected in the input frame being a non-hand anatomical feature;
defining, for at least a selected one of the at least one detected instance of the distinguishing anatomical feature, a virtual gesture-space based on the location of the selected one instance of the distinguishing anatomical feature, the virtual gesture-space being a shape defined within the input frame for detecting a gesture input;
processing only the virtual gesture-space that is the shape defined within each frame in the sequence of frames to detect and track at least one hand;
predicting, using information generated from detecting and tracking the at least one hand, a gesture class associated with the at least one hand; and
outputting the predicted gesture class associated with the at least one hand.
|