| CPC G10L 17/18 (2013.01) [G06N 3/08 (2013.01); G10L 17/02 (2013.01)] | 12 Claims | 

| 
               1. A neural network-based signal processing apparatus comprising: 
            at least one memory storing instructions; and 
                at least one processor configured to execute the instructions to: 
                receive multi-dimension features containing two or more two-dimension feature maps, where the multi-dimension features are labeled; 
                produce a first weight matrix including a plurality of first attentive weights for a plurality of elements, respectively, in the multi-dimension features by using a time-attentive neural network; 
                produce a second weight matrix including a plurality of second attentive weights for the plurality of elements, respectively, in the multi-dimensional features by using a channel-frequency-attentive neural network; 
                produce low-dimension features or posterior probabilities for designated classes, based on the multi-dimension features and the first and second attention weights; 
                multiply the first and second weight matrices and the multi-dimensional features; 
                train an attention network jointly with a classification network, using the multi-dimension features as labeled and after multiplication; 
                receive input data; and 
                classify the input data using the attention network and the classification network that have been trained. 
               |