US 12,333,253 B2
Automatic data domain identification
Malolan Chetlur, Jakkur (IN); Arvind Agarwal, New Delhi (IN); Subhendu Dey, Kolkata (IN); Sameep Mehta, Bangalore (IN); and Sandipan Sarkar, Kolkata (IN)
Assigned to International Business Machines Corporation, Armonk, NY (US)
Filed by International Business Machines Corporation, Armonk, NY (US)
Filed on Nov. 18, 2021, as Appl. No. 17/529,899.
Prior Publication US 2023/0153537 A1, May 18, 2023
Int. Cl. G06F 40/30 (2020.01); G06F 9/445 (2018.01); G06F 9/455 (2018.01); G06F 12/14 (2006.01); G06F 40/242 (2020.01); G06N 20/00 (2019.01); H04L 9/40 (2022.01)
CPC G06F 40/30 (2020.01) [G06F 40/242 (2020.01)] 20 Claims
OG exemplary drawing
 
1. An apparatus, comprising:
at least one processing device comprising a processor coupled to a memory, the at least one processing device, when executing program code, is configured to:
extract one or more entities identified in a plurality of data artifacts based at least in part on one or more datasets;
extract one or more entities identified in a plurality of code artifacts based at least in part on the one or more datasets;
extract one or more entities identified in a plurality of user interface artifacts based at least in part on the one or more datasets;
generate a set of dependency graphs each based at least in part on one or more relationships among the respective extracted one or more entities; and
perform one or more of a lexical analysis and a semantic analysis on the set of dependency graphs to identify a data domain of the one or more datasets.