US 12,112,767 B2
Acoustic data augmentation with mixed normalization factors
Toru Nagano, Taito-Ku (JP); Takashi Fukuda, Tokyo (JP); and Masayuki Suzuki, Tokyo (JP)
Assigned to International Business Machines Corporation, Armonk, NY (US)
Filed by INTERNATIONAL BUSINESS MACHINES CORPORATION, Armonk, NY (US)
Filed on May 21, 2021, as Appl. No. 17/326,463.
Prior Publication US 2022/0375484 A1, Nov. 24, 2022
Int. Cl. G10L 21/00 (2013.01); G06N 20/00 (2019.01); G10L 19/02 (2013.01); G10L 25/27 (2013.01)
CPC G10L 21/00 (2013.01) [G06N 20/00 (2019.01); G10L 19/02 (2013.01); G10L 25/27 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A method for acoustic data augmentation, the method comprising:
obtaining sets of audio data from different sources, wherein each of the respective sets comprises audio values with respect to time values over a respective time period;
calculating a respective normalization factor for at least four sets of the sets of audio data, wherein the audio values of the at least four sets are for a same type of audio value;
calculating a mixed normalization factor by using the at least four calculated normalization factors, wherein the mixed normalization factor is located at any point within a quadrilateral that includes the at least four calculated normalization factors;
normalizing at least two sets of the at least four sets by using the mixed normalization factor, wherein the normalized at least two sets together constitute training data; and
training an acoustic model by using the training data.