| CPC G10L 25/81 (2013.01) [G10L 25/30 (2013.01)] | 20 Claims |

|
11. An electronic device comprising:
an input interface;
a memory storing at least one instruction; and
at least one processor operatively connected with the input interface and the memory,
wherein the at least one processor is configured to execute the at least one instruction to:
obtain, from the input interface, a mixed sound source including at least one sound source,
obtain, based on the mixed sound source, scene information related to the mixed sound source
convert, based on the scene information, a first embedding vector corresponding to a designated sound source group into a second embedding vector, and
separate, based on the mixed sound source and the second embedding vector, the target sound source from the mixed sound source.
|