CPC G06F 40/211 (2020.01) [G06F 40/129 (2020.01); G06F 40/268 (2020.01)] | 20 Claims |
1. An apparatus and a method for generating a word embedding library, comprising:
an input interface; and
a processor,
wherein the processor is configured to receive original text composed of Hangul through the input interface,
segment the original text by morpheme and combine segmented morphemes step-by-step according to a preset rule,
match a tag to a combination of step-by-step morphemes according to a morphological attribute or a syntactic attribute of the combination of step-by-step morphemes, and
generate a word embedding library by classifying the morphemes included in the original text based on the tag matched to the combination of step-by-step morphemes,
a display configured to output various results calculated in a process of generating the word embedding library, and
a memory configured to pre-store various data necessary for generating the word embedding library.
|