US 11,954,150 B2
	Electronic device and method for controlling the electronic device thereof
Jungmin Lee, Suwon-si (KR)
Assigned to SAMSUNG ELECTRONICS CO., LTD., Suwon-si (KR)
Filed by SAMSUNG ELECTRONICS CO., LTD., Suwon-si (KR)
Filed on Mar. 29, 2019, as Appl. No. 16/369,142.
Claims priority of application No. 10-2018-0046072 (KR), filed on Apr. 20, 2018.
Prior Publication US 2019/0325224 A1, Oct. 24, 2019
Int. Cl. G10L 15/22 (2006.01); G06F 16/33 (2019.01); G06F 16/332 (2019.01); G06F 16/73 (2019.01); G06F 16/738 (2019.01); G06F 16/9032 (2019.01); G06N 20/00 (2019.01); G06V 10/22 (2022.01); G06V 10/764 (2022.01); G06V 10/82 (2022.01); G06V 20/20 (2022.01); G06V 20/40 (2022.01); G10L 17/22 (2013.01)

CPC G06F 16/73 (2019.01) [G06F 16/3329 (2019.01); G06F 16/3344 (2019.01); G06F 16/738 (2019.01); G06F 16/90332 (2019.01); G06N 20/00 (2019.01); G06V 10/235 (2022.01); G06V 10/764 (2022.01); G06V 10/82 (2022.01); G06V 20/20 (2022.01); G06V 20/40 (2022.01); G06V 20/41 (2022.01); G10L 15/22 (2013.01); G10L 17/22 (2013.01); G10L 2015/223 (2013.01)]

22 Claims

1. A method for controlling an electronic device comprising:

displaying a video;

storing a plurality of frames of the reproduced video for a first time period while displaying the video;

receiving a user voice input of a user while displaying a first frame of the video, the user voice input comprising a request for information about an object displayed in the video;

based on the user voice input comprising the request for information about the object being received:

selecting a plurality of second frames based on a time point at which the user voice input is received, wherein the plurality of second frames are included in the stored plurality of frames, and the plurality of second frames were displayed by the electronic device during a second time period that is a predetermined amount of time prior to the time point when the user voice input is started to be received;

obtaining the information about the object based on a result of inputting the plurality of second frames into an artificial intelligence model trained through an artificial intelligence algorithm, wherein the information about the object comprises identification information of the object, position information of the object and size information of the object;

acquiring a keyword corresponding to the object based on the identification information of the object, the position information of the object, the size information of the object and the user voice input; and

providing a search result based on the keyword.