US 11,056,096 B2
Artificial intelligence (AI)-based voice sampling apparatus and method for providing speech style in heterogeneous label
Jonghoon Chae, Seoul (KR)
Assigned to LG ELECTRONICS INC., Seoul (KR)
Filed by LG ELECTRONICS INC., Seoul (KR)
Filed on Sep. 10, 2019, as Appl. No. 16/566,265.
Claims priority of application No. 10-2019-0093560 (KR), filed on Jul. 31, 2019.
Prior Publication US 2020/0005764 A1, Jan. 2, 2020
This patent is subject to a terminal disclaimer.
Int. Cl. G10L 13/00 (2006.01); G10L 13/10 (2013.01); G10L 13/033 (2013.01); G10L 13/047 (2013.01)
CPC G10L 13/10 (2013.01) [G10L 13/033 (2013.01); G10L 13/047 (2013.01)] 18 Claims
OG exemplary drawing
1. An artificial intelligence (AI)-based voice sampling apparatus for providing a speech style in a heterogeneous label, the apparatus comprising:
a rhyme encoder configured to receive a user's voice, extract a voice sample, and analyze a vocal feature included in the voice sample;
a text encoder configured to receive an input of text for reflecting the vocal feature;
a processor configured to:
classify the voice sample input to the rhythm encoder into a label according to the vocal feature,
provide a weight by measuring a distance between a voice sample corresponding to the label and a voice sample corresponding to a heterogeneous label as a label other than the label or provide a weight by measuring a similarity between the label and the heterogeneous label,
extract an embedding vector representing the vocal feature,
generate a speech style from the embedding vector, and
apply the generated speech style to the text; and
a rhyme decoder configured to output synthesized voice data in which the speech style is applied to the text by the processor.