US 12,283,266 B2
Synthetic speech processing
Jaime Lorenzo Trueba, Cambridge (GB); Alejandro Ricardo Mottini d'Oliveira, Seattle, WA (US); Thomas Renaud Drugman, Carnieres (BE); and Sri Vishnu Kumar Karlapati, Cambridge (GB)
Assigned to Amazon Technologies, Inc., Seattle, WA (US)
Filed by Amazon Technologies, Inc., Seattle, WA (US)
Filed on Apr. 24, 2023, as Appl. No. 18/305,456.
Application 18/305,456 is a continuation of application No. 17/007,709, filed on Aug. 31, 2020, granted, now 11,735,156.
Prior Publication US 2023/0260501 A1, Aug. 17, 2023
This patent is subject to a terminal disclaimer.
Int. Cl. G10L 13/02 (2013.01); G06F 3/16 (2006.01); G06N 3/084 (2023.01); G10L 15/08 (2006.01); G10L 25/30 (2013.01)
CPC G10L 13/02 (2013.01) [G06F 3/16 (2013.01); G06N 3/084 (2013.01); G10L 15/08 (2013.01); G10L 25/30 (2013.01); G10L 2015/088 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A computer-implemented method comprising:
receiving first audio data representing first speech corresponding to a first voice, the first speech including a first plurality of words;
processing the first audio data to determine first encoded data corresponding to phoneme characteristics of the first speech;
processing the first audio data to determine second encoded data corresponding to a phrase of the first speech;
determining third encoded data corresponding to vocal characteristics of a second voice different from the first voice; and
processing the first encoded data, the second encoded data, and the third encoded data to determine third audio data representing second speech corresponding to the second voice, the second speech including the first plurality of words.