US 11,862,141 B2
Signal processing device and signal processing method
Naoya Takahashi, Tokyo (JP)
Assigned to SONY GROUP CORPORATION, Tokyo (JP)
Appl. No. 17/593,361
Filed by SONY GROUP CORPORATION, Tokyo (JP)
PCT Filed Mar. 13, 2020, PCT No. PCT/JP2020/011008
§ 371(c)(1), (2) Date Sep. 16, 2021,
PCT Pub. No. WO2020/195924, PCT Pub. Date Oct. 1, 2020.
Claims priority of application No. 2019-059819 (JP), filed on Mar. 27, 2019.
Prior Publication US 2022/0189496 A1, Jun. 16, 2022
Int. Cl. G10L 21/028 (2013.01); G10L 21/0208 (2013.01); G10L 25/84 (2013.01)
CPC G10L 21/028 (2013.01) [G10L 21/0208 (2013.01); G10L 25/84 (2013.01); G10L 2021/02087 (2013.01)] 16 Claims
OG exemplary drawing
 
1. A signal processing device, comprising:
a central processing unit (CPU) configured to:
receive an input acoustic signal associated with a plurality of sound sources;
execute, based on a sound source separation model learned in advance to separate a sound source from an acoustic signal, a sound source separation on the received input acoustic signal;
generate, based on the sound source separation, a plurality of separated signals;
determine that the sound source separation on a separated signal of the plurality of separation signals is to be ended based on an end condition,
wherein the end condition is a condition that an average energy level of the separated signal of the plurality of separated signals is equal to or less than a specific threshold value;
execute the sound source separation on the plurality of separated signals that does not satisfy the end condition; and
recursively execute the sound source separation, on the plurality of separated signals that does not satisfy the end condition, until each of the plurality of separated signals satisfies the end condition,
wherein the sound source separation model is trained as a N-source model, and
a number of the plurality of sound sources is greater than N.