| CPC G06F 18/2415 (2023.01) | 23 Claims |

|
1. A method for data classification using clustering, comprising:
replacing a plurality of portions of metadata for a plurality of data objects with a plurality of replacement characters in order to generate a plurality of replaced strings;
clustering the plurality of data objects into a plurality of clusters based on commonalities between the plurality of replaced strings of data objects of the plurality of data objects such that data objects among the plurality of data objects having the same replaced strings among the plurality of replaced strings are grouped into the same clusters among the plurality of clusters;
classifying a subset of the data objects in each cluster into at least one class; and
aggregating classes within at least one cluster of the plurality of clusters, wherein aggregating classes within each of the at least one cluster includes applying the at least one class for the subset of the data objects in each cluster to each other data object within the cluster.
|