| CPC G06F 16/215 (2019.01) [G06F 16/2228 (2019.01); G06F 16/2365 (2019.01); G06F 16/258 (2019.01); G06F 16/285 (2019.01)] | 20 Claims |

|
1. A method for data management, integration, and interoperability, the method comprising:
defining, by a data integration engine, at least one data model and asset by including data models, vocabulary, data quality rules, data mapping rules for at least one of, a particular data industry, a data domain, or a data subject area;
importing, by the data integration engine, data from a plurality of data sources;
performing, by the data integration engine, de-duplication of the imported data;
performing, by the data integration engine, data profiling of the imported data;
creating, by the data integration engine, linked data by semantic mapping, wherein creating the linked data by semantic mapping includes:
learning to perform an annotation task using features extracted from at least one column statistics feature of a relational table;
compressing at least one other feature into a fixed-size embedding using a subnetwork; and
training a two-fully connected layer network on at least one embedding feature and at least one column statistics feature for predicting a column type annotation for dataset.
|