US 11,748,600 B2
Quantization parameter optimization method and quantization parameter optimization device
Yukihiro Sasagawa, Yokohama (JP)
Assigned to SOCIONEXT INC., Kanagawa (JP)
Filed by SOCIONEXT INC., Kanagawa (JP)
Filed on Sep. 8, 2020, as Appl. No. 17/14,699.
Claims priority of application No. 2019-163981 (JP), filed on Sep. 9, 2019.
Prior Publication US 2021/0073635 A1, Mar. 11, 2021
Int. Cl. G06N 3/063 (2023.01); G06N 3/08 (2023.01); G06N 20/10 (2019.01); G06F 18/214 (2023.01); G06F 18/2431 (2023.01); G06F 18/2451 (2023.01); G06F 18/2453 (2023.01)
CPC G06N 3/063 (2013.01) [G06F 18/214 (2023.01); G06F 18/2431 (2023.01); G06F 18/2451 (2023.01); G06F 18/2453 (2023.01); G06N 3/08 (2013.01); G06N 20/10 (2019.01)] 10 Claims
OG exemplary drawing
 
10. A quantization parameter optimization device that determines a quantization parameter that is a weight parameter in a neural network having been quantized, the quantization parameter optimization device comprising:
a cost function determiner that determines a cost function in which a regularization term is added to an error function, the regularization term being a function of a quantization error that is an error between the weight parameter and the quantization parameter, the error function being a function for determining an error between an output value of the neural network and a corresponding teaching data value;
an updater that updates the quantization parameter by use of the cost function; and
a quantization parameter determiner that determines, as an optimized quantization parameter of a quantization neural network, the quantization parameter with which a function value derived from the cost function satisfies a predetermined condition, the optimized quantization parameter being obtained as a result of the updater repeating updating the quantization parameter, the quantization neural network being the neural network, the weight parameter of which has been quantized,
wherein the function value derived from the regularization term and an inference accuracy of the quantization neural network are negatively correlated.