| CPC G10L 21/04 (2013.01) | 20 Claims |

|
1. A computer-implemented system, comprising:
a memory that stores computer executable components; and
a processor that executes at least one of the computer executable components that:
performs length perturbation of an acoustic utterance comprising a group of frames in a sequence, wherein the performing the length perturbation comprises:
random sampling a first defined percentage of frames from the group of frames resulting in a first subset of drop frames;
for each drop frame, removing a first defined quantity of consecutive frames from the group of frames starting with the drop frame;
random sampling a second defined percentage of frames from the group of frames resulting in a second subset of insert frames; and
for each insert frame, inserting a second defined quantity of replacement frames into the acoustic utterance after the insert frame.
|