| CPC G06F 16/3329 (2019.01) [G10L 15/08 (2013.01)] | 18 Claims |

|
1. A computer-implemented method comprising:
receiving, by a first system component, first context data representing a current interaction between a user and a first user device, the current interaction corresponding to a first type of activity;
receiving second context data representing first sensor data generated by the first user device;
processing the first context data and the second context data using a first neural network encoder to generate first embedding data representing a situational context of the user;
receiving, from a first data storage component, user profile data corresponding to the user;
receiving, from a second data storage component, first data representing factual information about the first type of activity;
processing the user profile data and the first data using a second neural network encoder to generate second embedding data;
processing the first embedding data and the second embedding data using a neural network decoder to generate fourth data representing a natural language description of the user's situational context;
receiving, from the first user device, first input data representing a first utterance of the user;
processing the first input data to determine first natural language understanding (NLU) data representing a user request;
processing using the first NLU data and the fourth data, first response data; and
causing the first user device to output the first response data.
|