| CPC G10L 25/57 (2013.01) [G06V 20/41 (2022.01); G10L 21/043 (2013.01); G10L 25/63 (2013.01); H04N 21/4542 (2013.01)] | 20 Claims |

|
1. A computer-implemented method comprising:
receiving, from a sender device, an audio stream that is directed to a receiver device, wherein the sender device and the receiver device participate in a virtual metaverse;
providing, as input to a trained machine-learning model, the audio stream and a speech analysis score for a first user associated with the sender device;
generating as output, by the trained machine-learning model, a level of toxicity in the audio stream;
identifying silence or a pause between words in the audio stream, the silence or the pause corresponding to a particular timestamp in the audio stream; and
transmitting the audio stream to the receiver device, wherein the transmitting is performed to introduce a time delay in the audio stream based on the level of toxicity and wherein the time delay is introduced as a gap in the audio stream at the particular timestamp of the silence or the pause between words.
|