| CPC G06F 40/279 (2020.01) [G06F 40/30 (2020.01); G06F 40/40 (2020.01); G06N 5/022 (2013.01)] | 20 Claims |

|
1. A method for providing reasonable language model learning for text data in a computing system by a processor, comprising:
analyzing content from a plurality of data sources and a plurality of triples from a knowledge graph;
generating training data having a plurality of candidate labels derived from the analyzed content, each candidate label associated with a corresponding triple of the plurality of triples from the knowledge graph, the association of a first candidate label with a first triple based on semantic similarity between content from of a first data source and a keyword from the first triple;
training, by a convex continuous relation model, one or more reasonable language models based on the training data; and
generating text data by the trained reasonable language models using the plurality of triples.
|