| CPC G10L 15/16 (2013.01) [G06N 3/044 (2023.01); G06N 3/08 (2013.01)] | 20 Claims |

|
1. A method performed by one or more computers and for training a speech recognition neural network system comprising one or more neural networks, the method comprising:
pre-training one or more of the neural networks in the speech recognition neural network system on a corpus comprising text data, wherein pre-training the one or more of the neural networks in the speech recognition neural network system on a corpus comprising text data comprises:
pre-training one or more of the neural networks in the speech recognition neural network system on the corpus comprising text data to perform next-step prediction; and
after the pre-training, training the one or more neural networks in the speech recognition neural network system on training data that maps audio to text transcriptions.
|