US 12,443,633 B1
Determining device context
Xing Fan, Redmond, WA (US); Vasiliy Radostev, Seattle, WA (US); Jie Bao, Sammamish, WA (US); Muddu Krishna Chintha, Dublin, CA (US); Xiaojiang Huang, Bellevue, WA (US); Yi Luo, Issaquah, WA (US); Chenlei Guo, Redmond, WA (US); Nikko Strom, Kirkland, WA (US); Casey Stuart Smith, Bainbridge Island, WA (US); Spyridon Matsoukas, Hopkinton, MA (US); and Priti Bisaria, Los Gatos, CA (US)
Assigned to Amazon Technologies, Inc., Seattle, WA (US)
Filed by Amazon Technologies, Inc., Seattle, WA (US)
Filed on May 4, 2023, as Appl. No. 18/143,285.
Claims priority of provisional application 63/494,134, filed on Apr. 4, 2023.
Int. Cl. G06F 15/08 (2006.01); G06F 16/3329 (2025.01); G10L 15/08 (2006.01)
CPC G06F 16/3329 (2019.01) [G10L 15/08 (2013.01)] 18 Claims
OG exemplary drawing
 
1. A computer-implemented method comprising:
receiving, by a first system component, first context data representing a current interaction between a user and a first user device, the current interaction corresponding to a first type of activity;
receiving second context data representing first sensor data generated by the first user device;
processing the first context data and the second context data using a first neural network encoder to generate first embedding data representing a situational context of the user;
receiving, from a first data storage component, user profile data corresponding to the user;
receiving, from a second data storage component, first data representing factual information about the first type of activity;
processing the user profile data and the first data using a second neural network encoder to generate second embedding data;
processing the first embedding data and the second embedding data using a neural network decoder to generate fourth data representing a natural language description of the user's situational context;
receiving, from the first user device, first input data representing a first utterance of the user;
processing the first input data to determine first natural language understanding (NLU) data representing a user request;
processing using the first NLU data and the fourth data, first response data; and
causing the first user device to output the first response data.