CPC H04N 21/44008 (2013.01) [G06F 9/451 (2018.02); G06F 16/74 (2019.01); G06F 16/787 (2019.01); G06T 7/70 (2017.01); G06V 10/225 (2022.01); G06V 10/95 (2022.01); G06V 20/41 (2022.01); G06V 20/46 (2022.01); G06V 40/28 (2022.01); H04N 21/2187 (2013.01); H04N 21/47217 (2013.01); H04N 21/84 (2013.01); H04N 21/8547 (2013.01); G06T 2207/10016 (2013.01); G06T 2207/20081 (2013.01); G06T 2207/20084 (2013.01); G06T 2207/30196 (2013.01)] | 15 Claims |
1. An electronic device, comprising:
circuitry communicatively coupled to a display device, wherein the circuitry is configured to:
receive a first media stream that includes a video, wherein the video includes a signer that is one of an animated character or a person who uses a sign language to perform in the video;
detect, by application of a neural network model on frames of the video, hand signs associated with the sign language in the video;
determine a location of the signer in the video based on the detection of the hand signs, wherein
the determined location of the signer comprises image coordinates which correspond to corners of a rectangular region of the video that includes the signer;
receive a first user input regarding the determined location of the signer in the video;
extract a video portion from the rectangular region of the video based on the received first user input, wherein the video portion corresponds to the determined location of the signer in the video;
control a playback of the video on the display device; and
control the display device based on the playback to:
render a user interface (UI) element on the display device; and
display the extracted video portion inside the UI element.
|