US 12,456,477 B2
Audio source separation for multi-channel beamforming based on face detection
Saeed Mosayyebpour Kaskari, Irvine, CA (US)
Assigned to Synaptics Incorporated, San Jose, CA (US)
Filed by Synaptics Incorporated, San Jose, CA (US)
Filed on Apr. 19, 2023, as Appl. No. 18/303,432.
Prior Publication US 2024/0355349 A1, Oct. 24, 2024
Int. Cl. G10L 21/0272 (2013.01); G06V 40/16 (2022.01); G10L 21/0216 (2013.01); H04R 3/00 (2006.01); H04R 5/027 (2006.01); H04S 3/00 (2006.01)
CPC G10L 21/0272 (2013.01) [G06V 40/161 (2022.01); G10L 21/0216 (2013.01); H04R 3/005 (2013.01); H04R 5/027 (2013.01); H04S 3/008 (2013.01); G10L 2021/02166 (2013.01); H04S 2400/01 (2013.01); H04S 2400/15 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A method of processing audio signals, comprising:
receiving an audio signal via a plurality of microphones;
receiving an image associated with a frame of the audio signal;
detecting one or more faces in the received image;
selecting a number (N) of target faces among the one or more faces detected in the received image;
determining a respective direction of each of the N target faces relative to the plurality of microphones; and
selectively steering a beam associated with a multi-channel beamformer toward a direction-of-arrival (DOA) of the audio signal based at least in part on the directions of the N target faces.