CPC G10L 13/027 (2013.01) [G06F 3/167 (2013.01); G10L 15/1815 (2013.01)] | 20 Claims |
1. A computer-implemented method comprising:
receiving, from a user device, first audio data;
performing speech processing using the first audio data to determine first intent data representing a request to receive first media content;
sending, to a first system component corresponding to the first media content, first data representing a request to send the first media content to the user device;
receiving, from the user device, a first identifier corresponding to the first media content;
sending, to a context storage, the first identifier;
receiving, from the user device, second audio data;
performing speech processing using the second audio data to determine second intent data representing a request to send a media comment;
in response to determining the second intent data, sending, to a second system component, second data representing a request to receive the media comment from the user device;
determining, using the first identifier, to send the media comment to a third system component associated with a creator of the first media content;
causing, by the second system component, the user device to output first synthesized speech representing a first indication to begin recording the media comment;
receiving, from the user device, third audio data representing the media comment;
receiving a second indication to send the media comment; and
sending, to the third system component based on the second indication, a notification that a new media comment is available.
|