CPC G10L 13/08 (2013.01) [G06N 3/045 (2023.01); G10L 19/032 (2013.01)] | 13 Claims |
1. An electronic device comprising:
a communication interface;
a memory configured to store a first neural network model; and
a processor connected to the communication interface and the memory,
wherein the processor is configured to:
receive, from an external electronic device via the communication interface, compressed information related to an acoustic feature obtained based on a text;
decompress the compressed information to obtain decompressed information; and
obtain sound information corresponding to the text by inputting the decompressed information into the first neural network model,
wherein the first neural network model is obtained by training a relationship between a first plurality of sample acoustic features and a first plurality of sample sounds corresponding to the first plurality of sample acoustic features, and
wherein the first neural network model is trained to obtain the first plurality of sample sounds based on the first plurality of sample acoustic features and noise; and
wherein the first plurality of sample acoustic features are acoustic features that are distorted by compressing and decompressing a plurality of original acoustic features.
|