US 12,469,512 B2
	Echo suppression device, echo suppression method, and echo suppression program
Yuki Satomi, Yokohama (JP)
Assigned to TRANSTRON INC., Yokohama (JP)
Appl. No. 17/801,955
Filed by TRANSTRON INC., Yokohama (JP)
PCT Filed Apr. 7, 2021, PCT No. PCT/JP2021/014808 § 371(c)(1), (2) Date Aug. 24, 2022, PCT Pub. No. WO2021/210473, PCT Pub. Date Oct. 21, 2021.
Claims priority of application No. 2020-071463 (JP), filed on Apr. 13, 2020.
Prior Publication US 2023/0079749 A1, Mar. 16, 2023
Int. Cl. G10L 21/0232 (2013.01); G10L 21/0208 (2013.01); G10L 25/18 (2013.01); G10L 25/21 (2013.01)

CPC G10L 21/0232 (2013.01) [G10L 25/18 (2013.01); G10L 25/21 (2013.01); G10L 2021/02082 (2013.01)]

11 Claims

1. An echo suppression device configured to be provided in a transmitting signal path that transmits an input signal input from a microphone, in a near-end terminal including a speaker and the microphone, the echo suppression device, comprising

a mask generation unit that

generates a base mask based on a learning signal transmitted through the transmitting signal path when a speech is not input to the microphone and a sound is output from the speaker, and

generates a plurality of masks by changing a magnitude of the learning signal, the plurality of masks including the base mask;

a mask storage unit that stores the plurality of masks;

a mask selection unit that sequentially selects an optimal mask from among the plurality of masks, the optimal mask selected for each sample point in time that a reception signal is acquired, the reception signal transmitted through a receiving signal path that transmits a signal to the speaker, the optimal mask selected according to;

a magnitude of a reception signal and

the base mask generated for the reception signal acquired within a predetermined period before the sample point in time;

a double-talk detection unit that sequentially detects whether a speech is input to the microphone at the sample point in time that the reception signal is acquired based on a result of comparing the input signal with the optimal mask selected for the sample point in time; and

an echo suppressor that sequentially performs a process of suppressing an echo on the input signal in response to the double-talk detection unit detecting that a speech is not input to the microphone and the reception signal includes a speech.