| CPC G06F 16/353 (2019.01) [G06F 16/3326 (2019.01); G06F 16/367 (2019.01)] | 8 Claims |

|
1. A computer-implemented method for data management, the method comprising:
obtaining a dataset from a data source, by a processing unit, wherein the dataset comprises a plurality of datapoints, and wherein each of the datapoints belong to a column among a plurality of columns;
predicting an ontology label for at least one column in the dataset using a machine learning model, wherein the predicted ontology label is associated with an ontology comprising a plurality of ontology labels;
generating a mapping between the dataset and the ontology based on the relation between the predicted ontology label and the column;
classifying the datapoints with respect to the ontology labels based on the mapping generated;
outputting the classified datasets on a user interface;
providing an option to a user on the user interface for validation of predicted ontology labels for a plurality of datasets by a user-input via the user interface;
if the user rejects the respective predicted ontology label, requesting the user to manually select the correct ontology label for the column from a list of ontology labels associated with the ontology for assigning the correct ontology label to the column;
teaching the machine learning model a relationship between the column and the assigned ontology label;
identifying a relation between at least another column and at least another ontology label from the plurality of ontology labels, based on a user-input received from the user interface; and
training the machine learning model based on the relation identified;
wherein identifying the relation between the at least another column and the at least another ontology label from the plurality of ontology labels based on the user-input received from the user interface includes receiving the user-input from a user via the user interface, wherein the user input corresponds to assigning the ontology label to the at least another column; determining one or more attributes associated with the at least another column based on the user-input; and determining the relation based on the one or more attributes associated with the at least another column and one or more properties associated with the ontology label.
|