US 12,424,241 B2
Method for separating target sound source from mixed sound source and electronic device thereof
Jaemo Yang, Suwon-si (KR); Joonhyuk Chang, Suwon-si (KR); Geeyeun Kim, Suwon-si (KR); Hangil Moon, Suwon-si (KR); Kyoungho Bang, Suwon-si (KR); Dail Kim, Suwon-si (KR); Yungyeo Kim, Suwon-si (KR); Minsang Baek, Suwon-si (KR); Wongook Choi, Suwon-si (KR); and Jeonghwan Choi, Suwon-si (KR)
Assigned to SAMSUNG ELECTRONICS CO., LTD., Suwon-si (KR); and Industry-University Cooperation Foundation Hanyang University, Seoul (KR)
Filed by SAMSUNG ELECTRONICS CO., LTD., Suwon-si (KR); and INDUSTRY-UNIVERSITY COOPERATION FOUNDATION HANYANG UNIVERSITY, Seoul (KR)
Filed on Aug. 18, 2023, as Appl. No. 18/235,664.
Application 18/235,664 is a continuation of application No. PCT/KR2023/010971, filed on Jul. 27, 2023.
Claims priority of application No. 10-2022-0103538 (KR), filed on Aug. 18, 2022; and application No. 10-2022-0125096 (KR), filed on Sep. 30, 2022.
Prior Publication US 2024/0062773 A1, Feb. 22, 2024
Int. Cl. G10L 25/81 (2013.01); G10L 25/30 (2013.01)
CPC G10L 25/81 (2013.01) [G10L 25/30 (2013.01)] 20 Claims
OG exemplary drawing
 
11. An electronic device comprising:
an input interface;
a memory storing at least one instruction; and
at least one processor operatively connected with the input interface and the memory,
wherein the at least one processor is configured to execute the at least one instruction to:
obtain, from the input interface, a mixed sound source including at least one sound source,
obtain, based on the mixed sound source, scene information related to the mixed sound source
convert, based on the scene information, a first embedding vector corresponding to a designated sound source group into a second embedding vector, and
separate, based on the mixed sound source and the second embedding vector, the target sound source from the mixed sound source.