US 12,266,379 B2
Relaxed instance frequency normalization for neural-network-based audio processing
Byeonggeun Kim, Seoul (KR); Seunghan Yang, Incheon (KR); Hyunsin Park, Gwangmyeong (KR); Juntae Lee, Seoul (KR); and Simyung Chang, Suwon (KR)
Assigned to QUALCOMM Incorporated, San Diego, CA (US)
Filed by QUALCOMM Incorporated, San Diego, CA (US)
Filed on Oct. 3, 2022, as Appl. No. 17/937,765.
Claims priority of provisional application 63/252,100, filed on Oct. 4, 2021.
Prior Publication US 2023/0119791 A1, Apr. 20, 2023
Int. Cl. G10L 21/034 (2013.01); G10L 17/04 (2013.01); G10L 17/18 (2013.01); G10L 25/30 (2013.01); G10L 25/51 (2013.01)
CPC G10L 21/034 (2013.01) [G10L 17/04 (2013.01); G10L 17/18 (2013.01); G10L 25/30 (2013.01); G10L 25/51 (2013.01)] 27 Claims
OG exemplary drawing
 
1. A processor-implemented method comprising:
receiving an audio input;
generating a relaxed frequency-normalized version of the audio input comprises:
calculating one or more statistical measures for one or more hidden features in each of a plurality of feature dimensions in the received audio input; and
generating an instance-frequency-normalized version of the received audio input based on the calculated one or more statistical measures;
generating a classification of the received audio input using the relaxed frequency-normalized version of the audio input and a neural network trained to classify audio into one of a plurality of categories; and
taking one or more actions based on the classification of the received audio input.