CPC G06N 3/08 (2013.01) [G06N 3/0454 (2013.01); G06N 3/084 (2013.01)] | 15 Claims |
1. An information processing apparatus comprising:
one or more processors and one or more memories,
wherein the one or more processors performs, by executing programs stored in the one or more memories:
determining a plurality of three-dimensional blocks in which a four-dimensional weight parameter between a L layer and a layer next to the L layer of a neural network is divided, wherein the plurality of three-dimensional blocks are blocks in which a feature channel of the four-dimensional weight parameter having a number of channels based on a number of feature channels in the L layer and a number of feature channels in the next layer of the L layer of the neural network is divided by an integer value; and
encoding the four-dimensional weight parameter by approximating the plurality of three-dimensional blocks respectively by a linear combination of (a) codebook coefficients, and (b) two or more different three-dimensional codebook vectors, selected from a set of three-dimensional codebook vectors having a same channel size as the three-dimensional block, wherein the two or more different three-dimensional codebook vectors are part of a set of three-dimensional codebook vectors,
wherein three-dimensional codebook vectors are a predetermined number of three-dimensional codebook vectors selected with priority given to codebook vectors having larger absolute value.
|