US 12,008,330 B2
Apparatus and method for augmenting textual data
Na Un Kang, Seoul (KR); Geon Yi, Seoul (KR); Min Young Lee, Seoul (KR); and Min Soo Kim, Seoul (KR)
Assigned to SAMSUNG SDS CO., LTD., Seoul (KR)
Filed by SAMSUNG SDS CO., LTD., Seoul (KR)
Filed on Oct. 26, 2021, as Appl. No. 17/510,640.
Claims priority of application No. 10-2020-0139566 (KR), filed on Oct. 26, 2020.
Prior Publication US 2022/0129644 A1, Apr. 28, 2022
Int. Cl. G06F 40/40 (2020.01); G06F 18/21 (2023.01); G06F 40/284 (2020.01)
CPC G06F 40/40 (2020.01) [G06F 18/2193 (2023.01); G06F 40/284 (2020.01)] 22 Claims
OG exemplary drawing
 
1. An apparatus for augmenting textual data, the apparatus comprising:
a data augmenter configured to generate augmented data by augmenting input textual data according to a data augmentation scheme decided based on a type of natural language processing task of the input textual data; and
a data classifier configured to classify the augmented data into a positive sample or a negative sample by determining whether or not the augmented data maintains label information of the input textual data based on one or more data classification criteria,
wherein the data classifier comprises at least one of:
a first analyzer configured to decide whether or not the augmented data is the positive sample or the negative sample using a mapping table preset according to the data augmentation scheme and the type of natural language processing task of the input textual data;
a second analyzer configured to analyze whether or not the augmented data satisfies grammar to decide whether or not the augmented data is the positive sample or the negative sample; or
a third analyzer configured to compare a predicted value of user input label with a label of the augmented data to decide whether or not the augmented data is the positive sample or the negative sample.