| CPC H04N 21/439 (2013.01) [G06F 3/165 (2013.01); G06F 3/167 (2013.01); G10L 15/26 (2013.01); H04N 21/233 (2013.01); H04N 21/234336 (2013.01); H04N 21/2541 (2013.01); H04N 21/4108 (2013.01); H04N 21/4122 (2013.01); H04N 21/4126 (2013.01); H04N 21/42203 (2013.01); H04N 21/437 (2013.01); H04N 21/4415 (2013.01); H04N 21/4751 (2013.01); H04N 21/64322 (2013.01); H04N 21/6547 (2013.01); H04N 21/8113 (2013.01); H04N 21/8586 (2013.01); G10L 25/51 (2013.01)] | 20 Claims |

|
1. A computer-implemented method, comprising:
receiving first data representing an utterance;
receiving second data representing audio;
processing the second data to determine third data corresponding to media content;
sending the third data to a speech recognition component; and
processing the first data and the third data by the speech recognition component to determine a first action to be performed.
|