| CPC G10L 15/16 (2013.01) [G06N 3/088 (2013.01); G10L 15/142 (2013.01); G10L 25/51 (2013.01)] | 20 Claims |

|
11. A device comprising:
a network interface;
a memory; and
one or more processors coupled to the network interface and the memory, and configured to:
receive, from respective client devices, local acoustic model weights for a neural network-based acoustic model of a local automatic speech recognition system running on the respective client devices, wherein the local acoustic model weights are generated at the respective client devices without labelled data;
update a global automatic speech recognition system based on (a) the local acoustic model weights received from the respective client devices and (b) global acoustic model weights of the global automatic speech recognition system derived from labelled data to obtain an updated global automatic speech recognition system, including controlling an influence of unsupervised data on the updated global automatic speech recognition system by multiplying a first balancing coefficient with the local acoustic model weights and multiplying a second balancing coefficient with the global acoustic model weights, and combining resulting multiplication products thereof, wherein the first balancing coefficient is equal to one minus the second balancing coefficient, and wherein the second balancing coefficient has a value of zero to one; and
send the updated global automatic speech recognition system to the respective client devices to operate as a new local automatic speech recognition system.
|