| CPC G10L 21/0272 (2013.01) [G06V 40/161 (2022.01); G10L 21/0216 (2013.01); H04R 3/005 (2013.01); H04R 5/027 (2013.01); H04S 3/008 (2013.01); G10L 2021/02166 (2013.01); H04S 2400/01 (2013.01); H04S 2400/15 (2013.01)] | 20 Claims |

|
1. A method of processing audio signals, comprising:
receiving an audio signal via a plurality of microphones;
receiving an image associated with a frame of the audio signal;
detecting one or more faces in the received image;
selecting a number (N) of target faces among the one or more faces detected in the received image;
determining a respective direction of each of the N target faces relative to the plurality of microphones; and
selectively steering a beam associated with a multi-channel beamformer toward a direction-of-arrival (DOA) of the audio signal based at least in part on the directions of the N target faces.
|