| CPC G10L 15/02 (2013.01) [G06N 3/08 (2013.01); G06N 5/02 (2013.01); G10L 25/30 (2013.01)] | 20 Claims |

|
1. A method executed by a controller for speech recognition in a system comprising:
parsing a plurality of candidate speech recognition results from a speech input from a user, wherein the parsing is conducted utilizing information associated with a personal knowledge graph corresponding to a user profile of the user and a separate domain-specific knowledge graph that includes a curated collection of background information;
extracting and disambiguating one or more entities from the plurality of candidate speech recognition results, based on natural language understanding (NLU) information utilizing both the personal knowledge graph and the separate domain-specific knowledge graph, wherein the separate domain-specific knowledge graph includes a baseline group and an attention group, wherein the separate domain-specific knowledge graph further includes an attention mechanism configured to, during training, impede overweighting of similar nodes in the baseline group and the attention group, wherein the baseline group integrate embeddings by calculating an average of terms in each dialogue of a subject, wherein disambiguating one or more entities includes linking the one or more entities via utilizing both classifiers and embeddings with a neural network;
selecting and retrieving data indicative of information of the one or more entities from a remote server as one or more relevant entities;
in response to the one or more relevant entities extracted and disambiguated from the speech input, updating the personal knowledge graph associated with the user profile of the user utilizing the data indicative of information of the one or more relevant entities; and
outputting a response to the speech input utilizing the user profile and the NLU information.
|