US 12,494,204 B2
	Systems and methods for recognizing a speech of a speaker
Ilya Vladimirovich Mikhailov, Saint Petersburg (RU)
Assigned to RingCentral, Inc., Belmont, CA (US)
Filed by RingCentral, Inc., Belmont, CA (US)
Filed on Dec. 5, 2023, as Appl. No. 18/528,981.
Application 18/528,981 is a continuation of application No. 17/813,367, filed on Jul. 19, 2022, granted, now 12,002,464.
Application 17/813,367 is a continuation of application No. 16/583,688, filed on Sep. 26, 2019, granted, now 11,423,889, issued on Aug. 23, 2022.
Application 16/583,688 is a continuation of application No. PCT/RU2018/000906, filed on Dec. 28, 2018.
Prior Publication US 2024/0119934 A1, Apr. 11, 2024
This patent is subject to a terminal disclaimer.
Int. Cl. G10L 15/22 (2006.01); G06N 3/08 (2023.01); G10L 13/00 (2006.01); G10L 15/04 (2013.01); G10L 15/16 (2006.01); G10L 15/24 (2013.01); G10L 15/30 (2013.01); G10L 25/90 (2013.01)

CPC G10L 15/22 (2013.01) [G06N 3/08 (2013.01); G10L 13/00 (2013.01); G10L 15/04 (2013.01); G10L 15/16 (2013.01); G10L 15/24 (2013.01); G10L 15/30 (2013.01); G10L 25/90 (2013.01); G10L 2015/223 (2013.01)]

20 Claims

1. A computer-implemented method applied to a computing device for recognizing a command within an audio signal, the method comprising:

identifying, by a processor, a speaker from the audio signal;

generating a speech spectrogram of the audio signal;

separating, using a computer-based machine learning model, the audio signal to obtain a segment of the audio signal containing a speech associated with the identified speaker based on the speech spectrogram of the audio signal and an audio profile associated with the identified speaker;

determining, from the segment of the audio signal, the command from the speech associated with the identified speaker;

determining, from the segment of the audio signal, a score reflecting a value of confidence that the command determined is a valid command; and

in response to determining that the score is above a first threshold value, causing the command to be executed at the computing device.