US 11,990,119 B1
User feedback for speech interactions
Gilles Jean Roger Belin, Seattle, WA (US); Charles S. Rogers, III, Seatt, WA (US); Robert David Owen, Sammamish, WA (US); Jeffrey Penrod Adams, Tyngsborough, MA (US); Rajiv Ramachandran, Seattle, WA (US); and Gregory Michael Hart, Mercer Island, WA (US)
Assigned to Amazon Technologies, Inc., Seattle, WA (US)
Filed by Amazon Technologies, Inc., Seattle, WA (US)
Filed on Mar. 8, 2021, as Appl. No. 17/195,127.
Application 17/195,127 is a continuation of application No. 16/664,459, filed on Oct. 25, 2019, granted, now 10,950,220.
Application 16/664,459 is a continuation of application No. 15/925,397, filed on Mar. 19, 2018, granted, now 10,460,719, issued on Oct. 29, 2019.
Application 15/925,397 is a continuation of application No. 13/739,826, filed on Jan. 11, 2013, granted, now 9,922,639, issued on Mar. 20, 2018.
Int. Cl. G10L 15/22 (2006.01); G10L 15/00 (2013.01); G10L 15/07 (2013.01); G10L 15/26 (2006.01)
CPC G10L 15/00 (2013.01) [G10L 15/22 (2013.01); G10L 15/07 (2013.01); G10L 2015/221 (2013.01); G10L 15/26 (2013.01)] 16 Claims
OG exemplary drawing
 
1. A method comprising:
receiving, via one or more processors of a device, first input data associated with a first utterance of a user instructing the device to perform an action;
receiving, via the one or more processors of the device, output data to be output via an output component, the output data requesting feedback from the user based at least in part on the user instructing the device to perform the action;
receiving, via the one or more processors and at least partially in response to the output data, second input data associated with a second utterance of the user providing the feedback;
causing presentation of text on a display representing the second utterance of the user providing the feedback;
receiving, via an input component presented on the display along with the text, third input data representing an indication that the text presented on the display accurately represents the second utterance of the user providing the feedback;
based at least in part on the third input data, causing the second input data to update a model for speech processing;
receiving, via the one or more processors of the device, fourth input data corresponding to a third utterance of the user;
causing the fourth input data to undergo speech processing using the model as updated; and
receiving, based at least in part on the fourth input data and the model as updated, a response to the fourth input data.