US 12,411,556 B2
Using gestures to control a media player
Michael Cutter, Golden, CO (US); Sunil Ramesh, Cupertino, CA (US); and Karina Levitian, Austin, TX (US)
Assigned to Roku, Inc., San Jose, CA (US)
Filed by Roku, Inc., San Jose, CA (US)
Filed on Sep. 19, 2024, as Appl. No. 18/889,616.
Application 18/889,616 is a continuation of application No. 18/510,953, filed on Nov. 16, 2023, granted, now 12,124,635.
Application 18/510,953 is a continuation of application No. 17/973,150, filed on Oct. 25, 2022, granted, now 11,868,538, issued on Jan. 9, 2024.
Prior Publication US 2025/0013312 A1, Jan. 9, 2025
Int. Cl. G06F 3/01 (2006.01); G06T 7/70 (2017.01); G06V 10/764 (2022.01); G06V 20/52 (2022.01); G06V 40/20 (2022.01); H04N 23/90 (2023.01)
CPC G06F 3/017 (2013.01) [G06T 7/70 (2017.01); G06V 10/764 (2022.01); G06V 20/52 (2022.01); G06V 40/20 (2022.01); H04N 23/90 (2023.01); G06T 2207/30196 (2013.01); G06T 2207/30232 (2013.01); G06T 2207/30244 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A computing system configured for performing a set of acts comprising:
receiving, from an input device associated with the computing system, a command to map a customized gesture with a particular action of a plurality of actions that a media player is configured to perform;
in response to receiving the command, monitoring, using a first camera, a viewing environment of the media player to detect performance by a person of the customized gesture, wherein the first camera is mounted at a first location within the viewing environment of the media player;
determining training data of the person within the viewing environment of the media player performing a particular gesture;
based on the determined training data and further based on pose data specifying a known orientation of the first camera, generating a classification for use by the computing system for detecting the particular gesture using a second camera, wherein the second camera is mounted at a second location different from the first location and exterior to the viewing environment of the media player;
monitoring an environment of the second camera to detect the particular gesture;
correlating the detected particular gesture with a particular corresponding action of the plurality of actions; and
controlling the media player to perform the correlated particular corresponding action.