CPC G06F 40/40 (2020.01) [G06F 40/109 (2020.01); G06F 40/284 (2020.01)] | 23 Claims |
14. A method for processing natural language, the method performed by an apparatus comprising a collection module, a preprocessing module and a first machine learning module, the method comprising:
in the collection module, an operation of collecting a document having style information on text in the document, the style information indicating a style that has been applied to the text in the document;
in the preprocessing module, an operation of extracting the style information from the text of the collected document, and labeling the text with a position of a portion of the text to which the extracted style information has been applied; and
in the first machine learning module, an operation of receiving the text labeled with the position of the portion of the text to which the extracted style information has been applied, and being trained, by using the labeled text as learning data, to predict a position of a word having the style information in the received text.
|