| CPC G10L 13/02 (2013.01) [G06F 3/16 (2013.01); G06N 3/084 (2013.01); G10L 15/08 (2013.01); G10L 25/30 (2013.01); G10L 2015/088 (2013.01)] | 20 Claims |

|
1. A computer-implemented method comprising:
receiving first audio data representing first speech corresponding to a first voice, the first speech including a first plurality of words;
processing the first audio data to determine first encoded data corresponding to phoneme characteristics of the first speech;
processing the first audio data to determine second encoded data corresponding to a phrase of the first speech;
determining third encoded data corresponding to vocal characteristics of a second voice different from the first voice; and
processing the first encoded data, the second encoded data, and the third encoded data to determine third audio data representing second speech corresponding to the second voice, the second speech including the first plurality of words.
|