CPC G10L 15/01 (2013.01) [G06F 3/0484 (2013.01); G10L 13/047 (2013.01); G10L 13/08 (2013.01); G10L 15/02 (2013.01); G10L 15/063 (2013.01); G10L 15/22 (2013.01); G10L 21/0216 (2013.01); G10L 2015/025 (2013.01)] | 20 Claims |
1. A system comprising:
a memory circuitry for storing computer instructions;
a network interface circuitry; and
a processor in communication with the network interface circuitry and the memory circuitry, the processor configured to execute the computer instructions from the memory circuitry to:
receive speech samples uttered by a plurality of speakers;
determine a reference textual passage;
convert the reference textual passage into a set of machine-generated speeches corresponding to the plurality of speakers by automatically processing the reference textual passage and the speech samples using an automatic neural voice cloning model;
process the set of machine-generated speeches to produce a set of transcribed texts using at least one Automatic Speech Recognition (ASR) model; and
automatically quantify a bias in the at least one ASR model based on the set of transcribed texts and the reference textual passage.
|