| CPC G10L 25/30 (2013.01) [G06N 3/0464 (2023.01); G10L 25/84 (2013.01)] | 19 Claims | 

| 
               1. A computing system implementing a convolutional neural network (CNN) architecture, the CNN architecture comprising a multi-scale input block and a multi-scale nested block, wherein the multi-scale input block is configured to: 
            receive input data based on an audio signal; and 
                generate a first downsampled input data set by downsampling the input data; 
                wherein the multi-scale nested block comprises: 
                a first encoding layer configured to generate a first encoded data set by performing a convolution based on the input data, 
                  a second encoding layer configured to generate a second encoded data set by performing a convolution based on the first downsampled input data set, 
                  a first convolutional layer configured to generate a first output data set by performing a convolution based on the first encoded data set and an upsampled second encoded data set, wherein the upsampled second encoded data set is obtained by upsampling the second encoded data set, and 
                  a second convolutional layer configured to generate a second output data set based on the first output data set; 
                wherein the computing system is configured to output an output audio signal based on the second output data set. 
               |