US 11,720,786 B2
Information processing apparatus, information processing method, and program
Shunta Tate, Tokyo (JP); Masakazu Matsugu, Yokohama (JP); Yasuhiro Komori, Tokyo (JP); and Takayuki Saruta, Tokyo (JP)
Assigned to Canon Kabushiki Kaisha, Tokyo (JP)
Filed by CANON KABUSHIKI KAISHA, Tokyo (JP)
Filed on Sep. 22, 2017, as Appl. No. 15/713,470.
Claims priority of application No. 2016-188412 (JP), filed on Sep. 27, 2016.
Prior Publication US 2018/0089564 A1, Mar. 29, 2018
Int. Cl. G06N 3/08 (2006.01); G06N 3/04 (2006.01); G06N 3/084 (2023.01)
CPC G06N 3/08 (2013.01) [G06N 3/0454 (2013.01); G06N 3/084 (2013.01)] 15 Claims
OG exemplary drawing
 
1. An information processing apparatus comprising:
one or more processors and one or more memories,
wherein the one or more processors performs, by executing programs stored in the one or more memories:
determining a plurality of three-dimensional blocks in which a four-dimensional weight parameter between a L layer and a layer next to the L layer of a neural network is divided, wherein the plurality of three-dimensional blocks are blocks in which a feature channel of the four-dimensional weight parameter having a number of channels based on a number of feature channels in the L layer and a number of feature channels in the next layer of the L layer of the neural network is divided by an integer value; and
encoding the four-dimensional weight parameter by approximating the plurality of three-dimensional blocks respectively by a linear combination of (a) codebook coefficients, and (b) two or more different three-dimensional codebook vectors, selected from a set of three-dimensional codebook vectors having a same channel size as the three-dimensional block, wherein the two or more different three-dimensional codebook vectors are part of a set of three-dimensional codebook vectors,
wherein three-dimensional codebook vectors are a predetermined number of three-dimensional codebook vectors selected with priority given to codebook vectors having larger absolute value.