US 11,908,467 B1
Dynamic voice search transitioning
Rohit Prasad, Acton, MA (US); Anna Santos, Seattle, WA (US); David Sanchez, Seattle, WA (US); Jared Strawderman, Santa Clara, CA (US); Sarah Castle, Seattle, WA (US); Kerry Hammil, Seattle, WA (US); Christopher Schindler, Bainbridge Island, WA (US); Timothy Twerdahl, Los Altos, CA (US); Joseph Tavares, Kenmore, WA (US); and Bartosz Gulik, Issaquah, WA (US)
Assigned to Amazon Technologies, Inc., Seattle, WA (US)
Filed by Amazon Technologies, Inc., Seattle, WA (US)
Filed on Aug. 24, 2020, as Appl. No. 17/000,886.
Application 17/000,886 is a continuation of application No. 14/869,496, filed on Sep. 29, 2015, granted, now 10,770,067.
Claims priority of provisional application 62/215,691, filed on Sep. 8, 2015.
Int. Cl. G10L 21/00 (2013.01); G10L 25/00 (2013.01); G10L 15/22 (2006.01); H04N 21/422 (2011.01); H04N 21/478 (2011.01); H04N 21/482 (2011.01)
CPC G10L 15/22 (2013.01) [H04N 21/42225 (2013.01); H04N 21/478 (2013.01); H04N 21/4828 (2013.01); G10L 2015/223 (2013.01)] 16 Claims
OG exemplary drawing
 
1. A device comprising:
memory that stores computer-executable instructions; and
at least one processor configured to access the memory and execute the computer-executable instructions to:
receive a first incoming voice data indication that is associated with a button interaction at a remote control;
receive first voice data from the remote control;
send the first voice data to a remote server for processing;
receive a set of content search results from the remote server;
cause presentation of the set of content search results;
determine a selection of first content from the set of content search results via the remote control;
cause presentation of first audio content and associated first conversation card by the device;
receive second voice data from the remote control;
determine that the second voice data is a request for second audio data and a second conversation card;
cause, based on the determination that the second voice data is a request for second audio data and a second conversation card, to stop the presentation of the first audio data;
cause presentation of the second audio data and the second conversation card;
determine, based on at least one of a predetermined time for presenting the second audio data or a completion of the second audio data, to return to the presentation of first audio data and first conversation card; and
cause continued presentation of the first conversation card and the first audio data,
wherein the first conversation card includes a template-based data structure for presenting information.