| CPC H04S 7/303 (2013.01) [G10L 25/78 (2013.01); H04S 3/008 (2013.01); G10L 2025/783 (2013.01); H04S 2400/11 (2013.01); H04S 2400/13 (2013.01)] | 20 Claims |

|
15. A system comprising:
a processor; and
a memory coupled to the processor, with instructions stored thereon that, when executed by the processor, cause the processor to perform operations comprising:
receiving, from a server, encoded audio that includes a first audio stream and a first voice-activity detection (VAD) signal for the first audio stream, and a second audio stream, wherein the first audio stream and the second audio stream in the encoded audio are not separable;
determining that a first user associated with the first audio stream is blocked by a second user;
determining that the first VAD signal indicates that the first audio stream includes speech;
generating additional audio;
mixing the additional audio with the encoded audio; and
providing the mixed audio to a speaker for output to the second user.
|