US 11,989,632 B2
Apparatuses, methods, and computer program products for programmatically parsing, classifying, and labeling data objects
Rupal Haribhakti, Cupertino, CA (US); and Aaron Gentleman, San Jose, CA (US)
Assigned to ATLASSIAN PTY LTD, Sydney (AU); and ATLASSIAN US, INC., San Francisco, CA (US)
Filed by ATLASSIAN PTY LTD, Sydney (AU); and ATLASSIAN US, INC., San Francisco, CA (US)
Filed on Dec. 30, 2020, as Appl. No. 17/138,110.
Prior Publication US 2022/0207429 A1, Jun. 30, 2022
Int. Cl. G06N 20/10 (2019.01); G06F 18/21 (2023.01); G06F 18/214 (2023.01); G06F 18/2411 (2023.01); G06F 18/2451 (2023.01)
CPC G06N 20/10 (2019.01) [G06F 18/214 (2023.01); G06F 18/217 (2023.01); G06F 18/2411 (2023.01); G06F 18/2451 (2023.01)] 20 Claims
OG exemplary drawing
 
1. An apparatus for applying data classification labels to a data object, the apparatus comprising at least one processor and at least one non-transitory memory including program code that with the at least one processor, cause the apparatus to:
retrieve one or more data objects from a data object repository, wherein the one or more data objects each comprise a data object identifier, an origin identifier, and one or more text based data elements;
parse the one or more text based data elements into a plurality of word based data elements;
generate a vector data object from the plurality of word based data elements, the vector data object comprising one or more vector data elements;
map the vector data object to a trained data classification vector data set to determine a data classification label for the vector data object, wherein the trained data classification vector data set is generated by training a data classification learning model with a labeled data object repository; and
update the labeled data object repository to associate the data classification label for the vector data object with the plurality of word based data elements, the data object identifier, and the origin identifier.