| CPC G10L 15/16 (2013.01) [G10L 15/065 (2013.01)] | 20 Claims |

|
1. A processor comprising:
one or more circuits to perform automatic speech recognition (ASR) using one or more ASR machine learning models (MLMs), the one or more ASR MLMs trained, at least, by:
generating, using one or more ASR MLMs and first textual data, one or more spectrograms;
generating, using the one or more ASR MLMs and the one or more spectrograms, output data indicating second textual data; and
updating one or more parameters of the one or more ASR MLMs based at least on the output data and ground truth data associated with the first textual data.
|