US 12,407,783 B2
Double-microphone array echo eliminating method, device and electronic equipment
Minghui Song, Shenzhen (CN); and Jinhong Lin, Shenzhen (CN)
Assigned to Shenzhen Bluetrum Technology Co., Ltd., Shenzhen (CN)
Filed by Shenzhen Bluetrum Technology Co., Ltd., Shenzhen (CN)
Filed on Nov. 10, 2022, as Appl. No. 18/054,153.
Claims priority of application No. 202210470223.9 (CN), filed on Apr. 28, 2022.
Prior Publication US 2023/0353683 A1, Nov. 2, 2023
Int. Cl. H04M 9/08 (2006.01); G10L 21/0208 (2013.01); G10L 21/0216 (2013.01); G10L 21/0232 (2013.01); G10L 21/034 (2013.01); G10L 25/18 (2013.01)
CPC H04M 9/082 (2013.01) [G10L 21/0232 (2013.01); G10L 21/034 (2013.01); G10L 25/18 (2013.01); G10L 2021/02082 (2013.01); G10L 2021/02165 (2013.01)] 8 Claims
OG exemplary drawing
 
1. A dual-microphone array echo cancellation method, the method comprising:
acquiring far-end signals generated by an electronic equipment during a phone conversation, and a first near-end signal and a second near-end signal of a dual microphone array;
performing Fourier transform on the far-end signals to obtain far-end frequency domain information, performing Fourier transform on the first near-end signal to obtain first near-end frequency domain information, and performing Fourier transform on the second near-end signal to obtain second near-end frequency domain information;
performing filtering processing on the far-end frequency domain information by using the updated filter weight coefficient of the previous frame to obtain an echo frequency spectrum;
subtracting the echo frequency spectrum from the first near-end frequency domain information to obtain a first initial error frequency spectrum;
subtracting the echo frequency spectrum from the second near-end frequency domain information to obtain a second initial error frequency spectrum;
performing centroid fitting on the first initial error frequency spectrum and the second initial error frequency spectrum respectively to obtain initial suppression gain factors of the first near-end signal and the second near-end signal, which comprises calculating a first cross-correlation coefficient between the far-end frequency domain information and the first initial error frequency spectrum, and a second cross-correlation coefficient between the far-end frequency domain information and the second initial error frequency spectrum, performing centroid fitting on the first cross-correlation coefficient to obtain a first initial suppression gain factor, and performing centroid fitting on the second cross-correlation coefficient to obtain a second initial suppression gain factor, multiplying the first initial suppression gain factor by the first initial error frequency spectrum to obtain the first error frequency spectrum, and multiplying the second initial suppression gain factor by the second initial error frequency spectrum to obtain the second error frequency spectrum, and using the initial suppression gain factors as variable step-size factors for linear filtering processing of the next frame;
performing residual echo suppression on the first error frequency spectrum and the second error frequency spectrum by using an adaptive zero-pole echo canceller to obtain a target frequency spectrum signal;
after performing sub-band range selection on the initial suppression gain factors, obtaining a smooth factor, and performing full-band smoothing and exponential operation to obtain a secondary suppression gain factor;
performing filtering processing on the target frequency spectrum signal by using the secondary suppression gain factor to obtain a target near-end voice signal.