US 11,734,580 B2
Building analytical platform to enable device fabrication
Deepak Jain, Pune (IN); Sapankumar Hiteshchandra Shah, Pune (IN); Beena Rai, Pune (IN); Pritwish Mitra, Pune (IN); Sreedhar Reddy, Pune (IN); Neelanshi Wadhwa, Pune (IN); and Sarath Sasidharan, Pune (IN)
Assigned to TATA CONSULTANCY SERVICES LIMITED, Mumbai (IN)
Filed by Tata Consultancy Services Limited, Mumbai (IN)
Filed on May 18, 2021, as Appl. No. 17/323,760.
Claims priority of application No. 202021021081 (IN), filed on May 19, 2020.
Prior Publication US 2021/0365634 A1, Nov. 25, 2021
Int. Cl. G06N 5/022 (2023.01); G06F 16/93 (2019.01); G06F 16/901 (2019.01); G06F 40/289 (2020.01); G06N 3/08 (2023.01); G06F 40/258 (2020.01); G06F 40/295 (2020.01); G06F 40/30 (2020.01); G06F 40/279 (2020.01)
CPC G06N 5/022 (2013.01) [G06F 16/9024 (2019.01); G06F 16/93 (2019.01); G06F 40/258 (2020.01); G06F 40/279 (2020.01); G06F 40/289 (2020.01); G06F 40/295 (2020.01); G06F 40/30 (2020.01); G06N 3/08 (2013.01)] 17 Claims
OG exemplary drawing
 
1. A processor-implemented method comprising the steps of:
receiving, via one or more hardware processors, (i) a device category associated with a device fabrication, (ii) one or more device fabrication knowledge documents associated with the device category from a device fabrication knowledge repository, wherein each device fabrication knowledge document comprises a plurality of document paragraphs, wherein each document paragraph comprises one or more paragraph sentences, and each paragraph sentence comprises a plurality of paragraph sentence words;
pre-processing, via the one or more hardware processors, each device fabrication knowledge document of the one or more device fabrication knowledge documents, to obtain: (i) a plurality of pre-processed document paragraphs, in a plain text format, (ii) a section header for each pre-processed document paragraph of the plurality of pre-processed document paragraphs, wherein each pre-processed document paragraph comprises a plurality of pre-processed paragraph sentences, and wherein each pre-processed paragraph sentence comprises a plurality of pre-processed paragraph sentence words;
identifying, via the one or more hardware processors, one or more fabrication procedure paragraphs out of the plurality of pre-processed document paragraphs, by a trained fabrication procedure paragraph classification model, using the section header for each pre-processed document paragraph of the plurality of pre-processed document paragraphs, wherein each pre-processed document paragraph of the plurality of pre-processed document paragraphs is a fabrication procedure paragraph, if the pre-processed document paragraph comprises information related to a fabrication procedure;
identifying, via the one or more hardware processors, one or more entities, for each pre-processed paragraph sentence of the plurality of pre-processed paragraph sentences associated with each fabrication procedure paragraph of the one or more fabrication procedure paragraphs, by a trained named entity identification model, wherein the plurality of entities are associated with a plurality of predefined concepts related to the fabrication procedure;
identifying, via the one or more hardware processors, (i) one or more first predefined relations out of a first set of predefined relations, and (ii) one or more second predefined relations out of a second set of predefined relations, for each pre-processed paragraph sentence of the plurality of pre-processed paragraph sentences associated with each fabrication procedure paragraph of the one or more fabrication procedure paragraphs, by (i) a trained relation identification model, and (ii) a set of predefined pattern expressions, respectively, using the one or more entities identified for each pre-processed paragraph sentence;
identifying, via the one or more hardware processors, a device fabrication procedure for each fabrication procedure paragraph of the one or more fabrication procedure paragraphs, by a rule based unsupervised algorithm, using the one or more entities identified for each pre-processed paragraph sentence of the plurality of pre-processed paragraph sentences associated with each fabrication procedure paragraph, wherein the device fabrication procedure for each fabrication procedure paragraph, comprises a sequence of operations; and
generating, via the one or more hardware processors, an operation sequence knowledge graph for each fabrication procedure paragraph of the one or more fabrication procedure paragraphs, using (i) the one or more entities identified for each pre-processed paragraph sentence of the plurality of pre-processed paragraph sentences associated with each fabrication procedure paragraph of the one or more fabrication procedure paragraphs, (ii) (a) the one or more first predefined relations out of the first set of predefined relations, and (b) the one or more second predefined relations out of the second set of predefined relations, identified for each pre-processed paragraph sentence of the plurality of pre-processed paragraph sentences associated with each fabrication procedure paragraph of the one or more fabrication procedure paragraphs, and (iii) the device fabrication procedure for each fabrication procedure paragraph of the one or more fabrication procedure paragraphs, using a graph database tool.