US 11,868,538 B1
Using gestures to control a media player
Michael Cutter, San Jose, CA (US); Sunil Ramesh, San Jose, CA (US); and Karina Levitian, San Jose, CA (US)
Assigned to Roku, Inc., San Jose, CA (US)
Filed by Roku, Inc., San Jose, CA (US)
Filed on Oct. 25, 2022, as Appl. No. 17/973,150.
Int. Cl. G06F 3/00 (2006.01); G06V 10/764 (2022.01); G06V 40/20 (2022.01); G06V 20/52 (2022.01); G06T 7/70 (2017.01); H04N 23/90 (2023.01); G06F 3/01 (2006.01)
CPC G06F 3/017 (2013.01) [G06T 7/70 (2017.01); G06V 10/764 (2022.01); G06V 20/52 (2022.01); G06V 40/20 (2022.01); H04N 23/90 (2023.01); G06T 2207/30196 (2013.01); G06T 2207/30232 (2013.01); G06T 2207/30244 (2013.01)] 17 Claims
OG exemplary drawing
 
1. A computing system configured for performing a set of acts comprising:
receiving, from an input device associated with the computing system, a command to map a customized gesture with a particular action of a plurality of actions that a media player is configured to perform;
in response to receiving the command, monitoring, using a camera, a viewing environment of the media player to detect performance by a person of the customized gesture;
in response to detecting performance of the customized gesture:
(i) generating a classification for use by the computing system for detecting the customized gesture, and
(ii) storing, in memory, mapping data that correlates the detected customized gesture with the particular action;
detecting the person within one or more images of the viewing environment captured by the camera;
in response to detecting the person, monitoring, using the camera, the viewing environment to detect performance by the person of a gesture;
correlating the detected gesture with a corresponding action of a plurality of actions that the media player is configured to perform, wherein the correlating the detected gesture with the corresponding action of the plurality of actions that the media player is configured to perform comprises:
(i) comparing the detected gesture with a library of known gestures,
(ii) determining that the detected gesture has at least a threshold degree of similarity to a particular gesture of the library of known gestures, and
(iii) in response to determining that the detected gesture has at least the threshold degree of similarity to the particular gesture of the library of known gestures, selecting the corresponding action based on mapping data that correlates each gesture of the library of known gestures to a respective one of the plurality of actions; and
controlling the media player to perform the corresponding action.