CPC G06F 16/212 (2019.01) [G06F 40/295 (2020.01)] | 20 Claims |
1. A method of mapping a source schema of a first cloud based application to a target schema of a second cloud based application using a cloud based feature store, wherein the feature store comprises a central repository of features used for machine learning model training, the method comprising:
receiving a file comprising a plurality of source schema terms of the first application and a plurality of target schema terms of the second application, the file comprising a plurality of unmapped terms, wherein the source schema specifies how each of the source schema terms provide a first application name for each underlying data element of first application data stored in a first database, and the target schema specifies how the each of the target schema terms provide a second application name for each underlying data element of second application data stored in a second database, wherein the underlying data elements stored in the first database are the same as the underlying data elements stored in the second database;
parsing the source schema and the target schema and extracting source schema terms and target schema terms that correspond to the underlying data elements;
retrieving rule based mappings for the unmapped terms between the source schema terms and the target schema terms and generating rule based matches between source schema terms and target schema terms;
based on semantic matching of the source schema terms, retrieving feature store based mappings from the feature store for the unmapped terms between the source schema terms and the target schema terms that were not matched using the rule based mappings, wherein the semantic matching comprises classifying named entities mentioned in unstructured text into pre-defined categories using a trained named entity recognition (NER) model;
generating one or more similarity scores for the feature store based mappings of the source schema terms to the target schema terms; and
generating possible feature store based mappings for the extracted source schema terms and target schema terms based on the similarity scores;
wherein the feature store comprises key-value pairs with keys corresponding to source schema terms and values corresponding to matched target schema terms;
transferring the underlying data elements from the first database to the second database and, based on the feature store based mappings, automatically transforming by the feature store the source schema terms to corresponding target schema terms for each of the underlying data elements, the transforming comprising converting source messages of the source schema into target messages of the target schema.
|