US 12,283,288 B2
	Artificial latency for moderating voice communication
Mahesh Kumar Nandwana, Sunnyvale, CA (US); Philippe Clavel, Belmont, CA (US); and Morgan McGuire, Vancouver (CA)
Assigned to Roblox Corporation, San Mateo, CA (US)
Filed by Roblox Corporation, San Mateo, CA (US)
Filed on May 21, 2024, as Appl. No. 18/670,422.
Application 18/670,422 is a continuation of application No. 17/940,749, filed on Sep. 8, 2022, granted, now 12,027,177.
Prior Publication US 2024/0304210 A1, Sep. 12, 2024
Int. Cl. G10L 25/57 (2013.01); G06V 20/40 (2022.01); G10L 21/043 (2013.01); G10L 25/63 (2013.01); H04N 21/454 (2011.01)

CPC G10L 25/57 (2013.01) [G06V 20/41 (2022.01); G10L 21/043 (2013.01); G10L 25/63 (2013.01); H04N 21/4542 (2013.01)]

20 Claims

1. A computer-implemented method comprising:

receiving, from a sender device, an audio stream that is directed to a receiver device, wherein the sender device and the receiver device participate in a virtual metaverse;

providing, as input to a trained machine-learning model, the audio stream and a speech analysis score for a first user associated with the sender device;

generating as output, by the trained machine-learning model, a level of toxicity in the audio stream;

identifying silence or a pause between words in the audio stream, the silence or the pause corresponding to a particular timestamp in the audio stream; and

transmitting the audio stream to the receiver device, wherein the transmitting is performed to introduce a time delay in the audio stream based on the level of toxicity and wherein the time delay is introduced as a gap in the audio stream at the particular timestamp of the silence or the pause between words.