CPC G06N 3/08 (2013.01) [G06F 40/30 (2020.01); G06N 3/045 (2023.01)] | 18 Claims |
1. A computing device comprising:
memory storing one or more instructions; and
at least one processor configured to execute the one or more instructions stored in the memory to cause the at least one processor to:
identify a classification system of a category of data comprising a classification criterion of the category of the data and a plurality of keywords;
obtain data comprising at least one sentence; and
determine at least one category with respect to the at least one sentence of the data based on the classification system of the category of the data using a neural network configured to perform classification by unsupervised learning,
wherein the one or more instructions, when executed to determine the at least one category with respect to the at least one sentence of the data, cause the at least one processor to:
generate first modification data in which words excluding words matching the plurality of keywords are removed from first training data;
generate second modification data in which morphologically similar words of the words of the first modification data extend using an edit distance algorithm; and
based on word embedding, generate second training data comprising a word vector in which semantically similar words of words of the second modification data extend.
|