US 12,326,879 B2
Compression of target variable for multi-class classification
Jacques Doan Huu, Montigny le Bretonneux (FR)
Assigned to SAP SE, Walldorf (DE)
Filed by SAP SE, Walldorf (DE)
Filed on Jul. 24, 2023, as Appl. No. 18/357,294.
Prior Publication US 2025/0036656 A1, Jan. 30, 2025
Int. Cl. G06F 16/28 (2019.01); G06F 18/2415 (2023.01)
CPC G06F 16/285 (2019.01) [G06F 18/2415 (2023.01)] 15 Claims
OG exemplary drawing
 
1. A system comprising:
a memory storing processor-executable program code; and
at least one processing unit to execute the processor-executable program code to cause the system to:
determine a dataset comprising a plurality of instances, each instance comprising a value of each of a plurality of input variables and of a target variable, where the values of the target variable comprise a plurality of categories;
determine two or more infrequent categories of the plurality of categories from the dataset;
train a classifier to output a value of the target variable based on instances of the dataset including the infrequent categories and on no other instances of the dataset;
determine two or more non-separable categories from the two or more infrequent categories based on an output of the trained classifier for each instance of the dataset including the infrequent categories;
change occurrences of the two or more non-separable categories within the dataset to a single category to generate a modified dataset; and
train a second classifier to output a value of the target variable based on the modified dataset.