CPC G06F 16/213 (2019.01) [G06F 16/2282 (2019.01); G06F 16/2358 (2019.01); G06F 16/2462 (2019.01); G06F 18/2178 (2023.01); G06N 3/045 (2023.01)] | 20 Claims |
1. A computer program product comprising a computer readable storage medium having program instructions embodied therewith, the program instructions executable by one or more processors to cause the one or more processors to:
receive a portion of identifying information for one or more components of a database;
generate one or more descriptions for the one or more components based at least in part on the portion of the identifying information for the one or more components;
input the one or more descriptions and create, read, update and delete operations data of the database to one or more machine learning models;
predict, using the one or more machine learning models, one or more data types associated with the one or more components, wherein the prediction is based at least in part on the one or more descriptions and the create, read, update and delete operations data;
wherein the predicting comprises:
extracting from the create, read, update and delete operations data counts of a number of one or more of data reads, data writes, data deletes and data updates over a given time period for the one or more components; and
determining, based at least in part on the counts, the one or more data types associated with the one or more components; and
wherein the program instructions further cause the one or more processors to train the one or more machine learning models with: (i) labeled training data comprising respective ones of a plurality of data types corresponding to respective ones of a plurality of database components and respective ones of a plurality of descriptions of the database components; and (ii) data comprising correspondence between the respective ones of the plurality of data types and frequency of create, read, update and delete operations.
|