US 11,889,147 B2
	Display of signing video through an adjustable user interface (UI) element
Brant Candelore, Poway, CA (US); Adam Goldberg, Fairfax, VA (US); and Robert Blanchard, Escondido, CA (US)
Assigned to SONY GROUP CORPORATION, Tokyo (JP)
Filed by SONY GROUP CORPORATION, Tokyo (JP)
Filed on Nov. 4, 2021, as Appl. No. 17/453,553.
Prior Publication US 2023/0133869 A1, May 4, 2023
Int. Cl. H04N 21/44 (2011.01); G06F 9/451 (2018.01); G06F 16/74 (2019.01); G06V 20/40 (2022.01); G06V 10/22 (2022.01); H04N 21/2187 (2011.01); H04N 21/472 (2011.01); H04N 21/84 (2011.01); G06T 7/70 (2017.01); G06F 16/787 (2019.01); G06V 40/20 (2022.01); G06V 10/94 (2022.01); H04N 21/8547 (2011.01)

CPC H04N 21/44008 (2013.01) [G06F 9/451 (2018.02); G06F 16/74 (2019.01); G06F 16/787 (2019.01); G06T 7/70 (2017.01); G06V 10/225 (2022.01); G06V 10/95 (2022.01); G06V 20/41 (2022.01); G06V 20/46 (2022.01); G06V 40/28 (2022.01); H04N 21/2187 (2013.01); H04N 21/47217 (2013.01); H04N 21/84 (2013.01); H04N 21/8547 (2013.01); G06T 2207/10016 (2013.01); G06T 2207/20081 (2013.01); G06T 2207/20084 (2013.01); G06T 2207/30196 (2013.01)]

15 Claims

1. An electronic device, comprising:

circuitry communicatively coupled to a display device, wherein the circuitry is configured to:

receive a first media stream that includes a video, wherein the video includes a signer that is one of an animated character or a person who uses a sign language to perform in the video;

detect, by application of a neural network model on frames of the video, hand signs associated with the sign language in the video;

determine a location of the signer in the video based on the detection of the hand signs, wherein

the determined location of the signer comprises image coordinates which correspond to corners of a rectangular region of the video that includes the signer;

receive a first user input regarding the determined location of the signer in the video;

extract a video portion from the rectangular region of the video based on the received first user input, wherein the video portion corresponds to the determined location of the signer in the video;

control a playback of the video on the display device; and

control the display device based on the playback to:

render a user interface (UI) element on the display device; and

display the extracted video portion inside the UI element.