CPC G10L 21/0208 (2013.01) [G10L 17/00 (2013.01); G10L 21/0272 (2013.01); G10L 25/57 (2013.01); G10L 2021/02087 (2013.01)] | 20 Claims |
1. A method comprising:
receiving, by a user device, a first indication of one or more first speakers visible in a current view recorded by a camera of the user device;
in response to receiving the first indication, generating a respective isolated speech signal for each of the one or more first speakers that isolates speech of each of the one or more first speakers in the current view and sending the isolated speech signals for each of the one or more first speakers to a listening device operatively coupled to the user device, wherein sending the isolated speech signals for each of the one or more first speakers to the listening device comprises, for each first speaker of the one or more first speakers:
identifying a respective location of the first speaker relative to a location of the listening device that is configured to receive audio input from a plurality of audio channels; and
sending an isolated speech signal to a respective audio channel of the plurality of audio channels in accordance with the respective location of the first speaker corresponding to the isolated speech signal;
while generating the respective isolated speech signal for each of the one or more first speakers, receiving, by the user device, a second indication of one or more second speakers visible in the current view recorded by the camera of the user device; and
in response to the second indication, generating and sending a respective isolated speech signal for each of the one or more second speakers to the listening device.
|