US 12,073,822 B2
Voice generating method and apparatus, electronic device and storage medium
Xinyong Zhou, Beijing (CN); Junteng Zhang, Beijing (CN); Tao Sun, Beijing (CN); and Lei Jia, Beijing (CN)
Assigned to BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD., Beijing (CN)
Filed by BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD., Beijing (CN)
Filed on Dec. 21, 2022, as Appl. No. 18/086,004.
Claims priority of application No. 202111593297.3 (CN), filed on Dec. 23, 2021.
Prior Publication US 2023/0131494 A1, Apr. 27, 2023
Int. Cl. G10L 13/00 (2006.01); G06F 40/30 (2020.01); G10L 13/06 (2013.01); G10L 13/10 (2013.01); G10L 25/18 (2013.01); G10L 13/047 (2013.01)
CPC G10L 13/10 (2013.01) [G06F 40/30 (2020.01); G10L 13/06 (2013.01); G10L 25/18 (2013.01); G10L 13/047 (2013.01); G10L 2013/105 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A voice generating method, comprising:
acquiring a text to be processed, and determining an associated text of the text to be processed, wherein the associated text is a context text of the text to be processed;
acquiring an associated prosodic feature of the associated text;
determining an associated text feature of the associated text based on the text to be processed, wherein the associated text feature comprises a semantic information feature of the associated text;
determining a spectrum feature to be processed of the text to be processed based on the associated prosodic feature and the associated text feature; and
generating a target voice corresponding to the text to be processed based on the spectrum feature to be processed.