CPC G06N 5/022 (2013.01) [G06F 16/9024 (2019.01); G06F 16/93 (2019.01); G06F 40/258 (2020.01); G06F 40/279 (2020.01); G06F 40/289 (2020.01); G06F 40/295 (2020.01); G06F 40/30 (2020.01); G06N 3/08 (2013.01)] | 17 Claims |
1. A processor-implemented method comprising the steps of:
receiving, via one or more hardware processors, (i) a device category associated with a device fabrication, (ii) one or more device fabrication knowledge documents associated with the device category from a device fabrication knowledge repository, wherein each device fabrication knowledge document comprises a plurality of document paragraphs, wherein each document paragraph comprises one or more paragraph sentences, and each paragraph sentence comprises a plurality of paragraph sentence words;
pre-processing, via the one or more hardware processors, each device fabrication knowledge document of the one or more device fabrication knowledge documents, to obtain: (i) a plurality of pre-processed document paragraphs, in a plain text format, (ii) a section header for each pre-processed document paragraph of the plurality of pre-processed document paragraphs, wherein each pre-processed document paragraph comprises a plurality of pre-processed paragraph sentences, and wherein each pre-processed paragraph sentence comprises a plurality of pre-processed paragraph sentence words;
identifying, via the one or more hardware processors, one or more fabrication procedure paragraphs out of the plurality of pre-processed document paragraphs, by a trained fabrication procedure paragraph classification model, using the section header for each pre-processed document paragraph of the plurality of pre-processed document paragraphs, wherein each pre-processed document paragraph of the plurality of pre-processed document paragraphs is a fabrication procedure paragraph, if the pre-processed document paragraph comprises information related to a fabrication procedure;
identifying, via the one or more hardware processors, one or more entities, for each pre-processed paragraph sentence of the plurality of pre-processed paragraph sentences associated with each fabrication procedure paragraph of the one or more fabrication procedure paragraphs, by a trained named entity identification model, wherein the plurality of entities are associated with a plurality of predefined concepts related to the fabrication procedure;
identifying, via the one or more hardware processors, (i) one or more first predefined relations out of a first set of predefined relations, and (ii) one or more second predefined relations out of a second set of predefined relations, for each pre-processed paragraph sentence of the plurality of pre-processed paragraph sentences associated with each fabrication procedure paragraph of the one or more fabrication procedure paragraphs, by (i) a trained relation identification model, and (ii) a set of predefined pattern expressions, respectively, using the one or more entities identified for each pre-processed paragraph sentence;
identifying, via the one or more hardware processors, a device fabrication procedure for each fabrication procedure paragraph of the one or more fabrication procedure paragraphs, by a rule based unsupervised algorithm, using the one or more entities identified for each pre-processed paragraph sentence of the plurality of pre-processed paragraph sentences associated with each fabrication procedure paragraph, wherein the device fabrication procedure for each fabrication procedure paragraph, comprises a sequence of operations; and
generating, via the one or more hardware processors, an operation sequence knowledge graph for each fabrication procedure paragraph of the one or more fabrication procedure paragraphs, using (i) the one or more entities identified for each pre-processed paragraph sentence of the plurality of pre-processed paragraph sentences associated with each fabrication procedure paragraph of the one or more fabrication procedure paragraphs, (ii) (a) the one or more first predefined relations out of the first set of predefined relations, and (b) the one or more second predefined relations out of the second set of predefined relations, identified for each pre-processed paragraph sentence of the plurality of pre-processed paragraph sentences associated with each fabrication procedure paragraph of the one or more fabrication procedure paragraphs, and (iii) the device fabrication procedure for each fabrication procedure paragraph of the one or more fabrication procedure paragraphs, using a graph database tool.
|