US 12,242,643 B2
	System and method for secure transcription generation
William F. Ganong, III, Brookline, MA (US); and Uwe Helmut Jost, Groton, MA (US)
Assigned to Microsoft Technology Licensing, LLC, Redmond, WA (US)
Filed by Microsoft Technology Licensing, LLC, Redmond, WA (US)
Filed on Jun. 3, 2022, as Appl. No. 17/832,259.
Prior Publication US 2023/0394169 A1, Dec. 7, 2023
Int. Cl. G06F 21/62 (2013.01); G06F 21/84 (2013.01); G06F 40/166 (2020.01); G06F 40/295 (2020.01); G10L 13/02 (2013.01); G10L 15/06 (2013.01); G10L 15/08 (2006.01); G10L 15/22 (2006.01)

CPC G06F 21/6245 (2013.01) [G06F 21/84 (2013.01); G06F 40/166 (2020.01); G06F 40/295 (2020.01); G10L 13/02 (2013.01); G10L 15/063 (2013.01); G10L 15/08 (2013.01); G10L 15/22 (2013.01); G10L 2015/088 (2013.01)]

17 Claims

1. A computer-implemented method, executed on a computing device, comprising:

receiving an input speech signal;

receiving a transcription of the input speech signal;

identifying one or more sensitive content portions from the transcription of the input speech signal;

obscuring the one or more sensitive content portions from the transcription of the input speech signal, thus defining an obscured transcription of the input speech signal; and

generating an obscured speech signal based upon, at least in part, the input speech signal and the obscured transcription of the input speech signal, wherein generating an obscured speech signal based upon, at least in part, the input speech signal and the obscured transcription of the input speech signal includes:

comparing portions of the transcription of the input speech signal to corresponding portions of the obscured transcription of the input speech signal;

in response to determining that the portions of the transcription of the input speech signal are different from the corresponding portions of the obscured transcription of the input speech signal, synthesizing the corresponding portions of the obscured transcription of the input speech signal to generate the obscured speech signal; and

in response to determining that the portions of the transcription of the input speech signal are the same as the corresponding portions of the obscured transcription of the input speech signal, using a corresponding portion of the input speech signal to generate the obscured speech signal.