CPC G06N 7/01 (2023.01) [G06F 3/0653 (2013.01)] | 20 Claims |
1. A method, comprising:
accessing a dataset;
selecting a list of attributes of the dataset, each of the attributes being selected based on a determination that the attribute is affecting growth of the dataset and affecting an amount of data storage space consumed by the dataset;
assigning a SHAP score to each attribute;
using the SHAP scores to assign respective weights to each attribute;
deriving drift and shock information for the dataset, and the drift and shock information is derived from the SHAP scores;
based on the drift and shock information, calculating a risk score that a storage capacity of an asset where the dataset is stored will be exhausted within a particular time interval; and
using the risk score as a basis to identify, and implement, an action to reduce a risk that the storage capacity of the asset will be exhausted within the particular time interval.
|