CPC G06N 20/10 (2019.01) [G06F 18/214 (2023.01); G06F 18/217 (2023.01); G06F 18/2411 (2023.01); G06F 18/2451 (2023.01)] | 20 Claims |
1. An apparatus for applying data classification labels to a data object, the apparatus comprising at least one processor and at least one non-transitory memory including program code that with the at least one processor, cause the apparatus to:
retrieve one or more data objects from a data object repository, wherein the one or more data objects each comprise a data object identifier, an origin identifier, and one or more text based data elements;
parse the one or more text based data elements into a plurality of word based data elements;
generate a vector data object from the plurality of word based data elements, the vector data object comprising one or more vector data elements;
map the vector data object to a trained data classification vector data set to determine a data classification label for the vector data object, wherein the trained data classification vector data set is generated by training a data classification learning model with a labeled data object repository; and
update the labeled data object repository to associate the data classification label for the vector data object with the plurality of word based data elements, the data object identifier, and the origin identifier.
|