US 12,437,758 B2
	Display apparatus and a voice control method
Bingqing Wang, Shandong (CN); Wei Gao, Shandong (CN); Shuo Yu, Shandong (CN); Junhou Jiang, Shandong (CN); Yazhou Jia, Shandong (CN); Guohua Yue, Shandong (CN); Xinpei Zhu, Shandong (CN); Dejin Chu, Shandong (CN); Baocheng Li, Shandong (CN); Jialin Li, Shandong (CN); and Hanyong Wu, Shandong (CN)
Assigned to HISENSE VISUAL TECHNOLOGY CO., LTD., Shandong (CN)
Filed by HISENSE VISUAL TECHNOLOGY CO., LTD., Shandong (CN)
Filed on Feb. 15, 2023, as Appl. No. 18/169,313.
Application 18/169,313 is a continuation of application No. PCT/CN2021/119212, filed on Sep. 18, 2021.
Claims priority of application No. 202011268427.1 (CN), filed on Nov. 13, 2020; application No. 202110842951.3 (CN), filed on Jul. 26, 2021; and application No. 202110843767.0 (CN), filed on Jul. 26, 2021.
Prior Publication US 2023/0197082 A1, Jun. 22, 2023
Int. Cl. G10L 15/18 (2013.01); G06F 40/18 (2020.01); G06V 30/10 (2022.01); G10L 15/08 (2006.01); G10L 15/22 (2006.01)

CPC G10L 15/22 (2013.01) [G06V 30/10 (2022.01); G10L 15/08 (2013.01)]

18 Claims

1. A display apparatus, comprising:

a display, configured to display an image from a broadcast system or a network, and/or a user interface;

a detector, configured to acquire voice information from a user; and

a controller, in connection with the display and the detector and configured to:

display a user interface on the display;

obtain the voice information input from the user while the user interface is displaying on the display;

in response to the voice information, extract at least one keyword from the voice information, wherein the at least one keyword comprises a name content for indicating a controlled object and an action content for indicating an execution action;

traverse action items in a configuration library, wherein controlled objects of the action items in the configuration library are configured according to applications built-in the display apparatus; in response to determining that no action item in the configuration library matches the at least one keyword, obtain text information of the user interface on the display, and obtain layout information of the user interface; extract a function control in a layout of the user interface according to the text information, wherein the function control is a control having a first text presented on the display and matched with the at least one keyword; and generate a control instruction according to the function control and the voice information;

in response to determining that a first action item in the configuration library matches the at least one keyword, cause the display apparatus to execute the first action item;

wherein the controller is further configured to:

traverse positions of all controls in the layout information of the user interface;

calculate a distance between a position of a second text in the text content in an image of the user interface and a position of a second control among the controls in the layout information of the user interface; and

in response to determining that the distance is less than or equal to a preset distance threshold, mark the second control corresponding to the distance as the function control.