| CPC G06F 40/166 (2020.01) [G06F 18/214 (2023.01); G06F 21/6245 (2013.01); G06F 40/279 (2020.01); G06F 40/30 (2020.01); G06N 20/00 (2019.01)] | 20 Claims |

|
1. A computer-implemented method comprising:
receiving, by a transcription service in a provider network, via a first application programming interface offered by the transcription service, a first request to generate a redacted transcript of content, the transcription service implemented by one or more electronic devices;
obtaining, by the transcription service, a transcript of the content;
sending, by the transcription service, the transcript to a model endpoint hosted by a machine learning service in the provider network, wherein the transcript is received at the model endpoint, wherein the machine learning service is implemented by one or more electronic devices;
receiving, by the transcription service, an inference response identifying one or more sensitive entities in the transcript;
generating, by the transcription service, the redacted transcript based at least on the transcript and the inference response, wherein the redacted transcript comprises a respective sensitive entity tag in place of each sensitive entity of the one or more sensitive entities, wherein the respective sensitive entity tag comprises a name indicating a sensitive entity data type of the sensitive entity;
receiving, by the transcription service, via a second application programming interface offered by the transcription service, a second request to obtain all sensitive entities of a particular sensitive entity data type, wherein the second request comprises a name of the particular sensitive entity data type;
querying, by the transcription service, a representation of the one or more sensitive entities based on the name of the particular sensitive entity data type; and
returning, by the transcription service, via the second application programming interface, all sensitive entities, of the one or more sensitive entities, that are of the particular sensitive entity data type.
|