CPC G10L 15/01 (2013.01) [G06F 16/35 (2019.01); G06N 5/022 (2013.01); G10L 15/02 (2013.01); G10L 15/10 (2013.01); G10L 15/187 (2013.01); G10L 15/22 (2013.01); G10L 2015/025 (2013.01)] | 20 Claims |
1. A system for correcting automatic speech recognition (ASR) errors comprising:
one or more processors; and
a memory in communication with the one or more processors and storing instructions that, when executed by the one or more processors, are configured to cause the system to:
receive, via an ASR model, a transcription comprising one or more transcribed words;
retrieve one or more respective predefined confidence levels associated with the one or more transcribed words;
determine whether the one or more transcribed words exceed the one or more respective predefined confidence levels;
responsive to determining that a first transcribed word of the one or more transcribed words does not exceed a first respective predefined confidence level, generate, using a first machine learning model, a first predicted word;
generate, using the first machine learning model, a first numerical representation of the first transcribed word and a second numerical representation of the first predicted word;
calculate a distance between the first and second numerical representations;
determine whether the distance exceeds a predefined threshold; and
responsive to determining that the distance exceeds the predefined threshold:
retrieve a first list comprising a plurality of predefined red flag words;
determine whether at least one of the plurality of predefined red flag words corresponds to a context of a grouping of transcribed words surrounding the first transcribed word by iteratively substituting each predefined red flag word of the plurality of predefined red flag words for the first transcribed word; and
responsive to determining the at least one of the plurality of predefined red flag words corresponds to the context of the grouping of transcribed words surrounding the first transcribed word, classify the transcription as being associated with a first category.
|