US 12,106,230 B2
Implementing relation linking for knowledge bases
Nandana Mihindukulasooriya, Cambridge, MA (US); Gaetano Rossiello, Brooklyn, NY (US); Alfio Massimiliano Gliozzo, Brooklyn, NY (US); Pavan Kapanipathi Bangalore, Westchester, NY (US); and Salim Roukos, Redondo Beach, CA (US)
Assigned to International Business Machines Corporation, Armonk, NY (US)
Filed by International Business Machines Corporation, Armonk, NY (US)
Filed on Oct. 23, 2020, as Appl. No. 17/079,202.
Prior Publication US 2022/0129770 A1, Apr. 28, 2022
Int. Cl. G06N 5/04 (2023.01); G06F 16/33 (2019.01); G06N 20/00 (2019.01)
CPC G06N 5/04 (2013.01) [G06F 16/334 (2019.01); G06N 20/00 (2019.01)] 16 Claims
OG exemplary drawing
 
1. A computer-implemented method, comprising:
identifying a natural language query;
translating the natural language query into an intermediate representation;
converting the intermediate representation into one or more query triples having two levels of information; and
performing, for each of the one or more query triples, relation linking between the respective query triple and a plurality of knowledge base triples;
wherein the intermediate representation includes a graph, where: nodes in the graphs represents concepts, and edges within the graph represent relationships between concept nodes;
wherein performing the relation linking includes inputting the one or more query triples into a machine learning environment;
wherein the machine learning environment outputs, for the one or more query triples, a plurality of relationships of the respective query triple with knowledge base triples, as well as scores indicative of a strength of similarity for the respective relationship;
training the machine learning environment using a distant supervision dataset, including:
identifying a knowledge base, the knowledge base including a plurality of natural language sentences and a plurality of knowledge base triples,
for individual ones of the plurality of knowledge base triples, selecting one of the plurality of natural language sentences within the knowledge base as a corresponding natural language sentence for the knowledge base triple, and
training the machine learning environment, utilizing the plurality of knowledge base triples and their corresponding natural language sentences;
determining an answer to the natural language query using the scores and the trained machine learning environment; and
outputting the answer for displaying on a display.