CPC G10L 13/10 (2013.01) [G06F 40/30 (2020.01); G10L 13/06 (2013.01); G10L 25/18 (2013.01); G10L 13/047 (2013.01); G10L 2013/105 (2013.01)] | 20 Claims |
1. A voice generating method, comprising:
acquiring a text to be processed, and determining an associated text of the text to be processed, wherein the associated text is a context text of the text to be processed;
acquiring an associated prosodic feature of the associated text;
determining an associated text feature of the associated text based on the text to be processed, wherein the associated text feature comprises a semantic information feature of the associated text;
determining a spectrum feature to be processed of the text to be processed based on the associated prosodic feature and the associated text feature; and
generating a target voice corresponding to the text to be processed based on the spectrum feature to be processed.
|