US 12,217,741 B2
	Large scale privacy-preserving speech recognition system using federated learning
Sylvain Le Groux, Placerville, CA (US); and Erwan Barry Tarik Zerhouni, Zürich (CH)
Assigned to CISCO TECHNOLOGY, INC., San Jose, CA (US)
Filed by Cisco Technology, Inc., San Jose, CA (US)
Filed on May 19, 2021, as Appl. No. 17/324,535.
Prior Publication US 2022/0383857 A1, Dec. 1, 2022
Int. Cl. G10L 15/16 (2006.01); G06N 3/088 (2023.01); G10L 15/14 (2006.01); G10L 25/51 (2013.01)

CPC G10L 15/16 (2013.01) [G06N 3/088 (2013.01); G10L 15/142 (2013.01); G10L 25/51 (2013.01)]

20 Claims

11. A device comprising:

a network interface;

a memory; and

one or more processors coupled to the network interface and the memory, and configured to:

receive, from respective client devices, local acoustic model weights for a neural network-based acoustic model of a local automatic speech recognition system running on the respective client devices, wherein the local acoustic model weights are generated at the respective client devices without labelled data;

update a global automatic speech recognition system based on (a) the local acoustic model weights received from the respective client devices and (b) global acoustic model weights of the global automatic speech recognition system derived from labelled data to obtain an updated global automatic speech recognition system, including controlling an influence of unsupervised data on the updated global automatic speech recognition system by multiplying a first balancing coefficient with the local acoustic model weights and multiplying a second balancing coefficient with the global acoustic model weights, and combining resulting multiplication products thereof, wherein the first balancing coefficient is equal to one minus the second balancing coefficient, and wherein the second balancing coefficient has a value of zero to one; and

send the updated global automatic speech recognition system to the respective client devices to operate as a new local automatic speech recognition system.