| CPC G06F 3/167 (2013.01) [G06F 3/0482 (2013.01); G06F 3/04847 (2013.01); G06F 3/04883 (2013.01); G10L 15/22 (2013.01); G10L 2015/223 (2013.01)] | 20 Claims |

|
1. A method implemented by one or more processors, the method comprising:
receiving, via a computing device, a spoken utterance intended for controlling a GUI element; and
in response to receiving the spoken utterance:
accessing, by an automated assistant accessible via the computing device, content description data that characterizes GUI elements displayed via a display of the computing device, wherein the content description data is not displayed via the display,
comparing, by the automated assistant, one or more terms in the spoken utterance to the content description data that characterizes the GUI elements displayed at the display, to identify whether the one or more terms in the spoken utterance are associated with any GUI element from the GUI elements displayed at the display, and
in response to identifying that at least a portion of the one or more terms in the spoken utterance is associated with a particular GUI element from the GUI elements displayed at the display:
controlling, by the automated assistant, the particular GUI element in response to the spoken utterance.
|