CPC G06F 40/211 (2020.01) [G06F 40/40 (2020.01)] | 7 Claims |
1. A word processing system, comprising:
a first generation unit which generates, based on sentence information including a plurality of sentences, hierarchy data indicating a syntax tree for each hierarchy with regard to each sentence;
a second generation unit which
acquires, from a plurality of hierarchy data generated by the first generation unit, hierarchy data of a second sentence similar to hierarchy data of a first sentence generated by the first generation unit,
extracts a difference between the hierarchy data of the first sentence and the hierarchy data of the second sentence, and
generates, as paraphrasing rule data, first expression data as a difference in the first sentence and second expression data as a difference in the second sentence; and
a storage unit which stores the paraphrasing rule data generated by the second generation unit in a storage unit,
wherein the plurality of sentences including the first sentence and the second sentence are in a same natural language,
wherein the first sentence and the second sentence are similar as paraphrases and are not identical,
wherein the first expression data consists of nodes in a syntax tree of the second sentence that are not present in a syntax tree of the first sentence,
wherein the second expression data consists of nodes in the syntax tree of the first sentence that are not present in the syntax tree of the second sentence, and
wherein the paraphrasing rule data converts the first expression data into the second expression data.
|