CPC G10L 15/063 (2013.01) [G10L 15/02 (2013.01); G10L 15/22 (2013.01); G10L 15/30 (2013.01); G10L 2015/0638 (2013.01)] | 20 Claims |
1. An electronic device comprising:
a processor; and
a memory operatively connected to the processor,
wherein the memory stores instructions that, when executed, cause the processor to:
receive a voice input of a user,
extract a feature from the voice input of the user,
select an acoustic model through comparison with the extracted feature, and
perform fine-tuning on the selected acoustic model based on an utterance-induced value, so that the selected acoustic model learns the feature of the voice input, and
wherein the utterance-induced value includes a threshold value for determining a similarity between an utterance-induced text and an obtained text.
|