CPC G10L 13/08 (2013.01) [G06F 18/10 (2023.01); G06F 18/2135 (2023.01); G06F 18/217 (2023.01); G06N 3/02 (2013.01); G06N 3/042 (2023.01); G06N 3/08 (2013.01); G06N 5/02 (2013.01); G10L 13/04 (2013.01); G10L 19/00 (2013.01)] | 20 Claims |
1. A text-to-speech (TTS) system including one or more processors and one or more memories configured to perform operations for converting text into a corrected speech signal comprising:
training a neural network based upon, at least in part, data of previously generated speech in a pre-existing knowledgebase of phonemes, wherein the previously generated speech has an inaccuracy;
generating a lossy representation of at least a portion of the data for use in the training; and
applying lossy representation of at least the portion of the data to the previously generated speech for correcting the inaccuracy of the previously generated speech in the pre-existing knowledgebase of phonemes.
|