US 12,456,076 B2
	Activation based dynamic network pruning
Minhoo Kang, Seongnam-si (KR)
Assigned to REBELLIONS INC., Seongnam-si (KR)
Filed by REBELLIONS INC., Seongnam-si (KR)
Filed on Dec. 12, 2023, as Appl. No. 18/537,689.
Claims priority of application No. 10-2023-0068299 (KR), filed on May 26, 2023.
Prior Publication US 2024/0394594 A1, Nov. 28, 2024
Int. Cl. G06N 20/00 (2019.01)

CPC G06N 20/00 (2019.01)

11 Claims

1. A method for controlling operations of a machine learning model, the method being performed by one or more processors and comprising:

receive a first command to statistically determine a threshold for skipping an operation associated with a node and continuously maintain the threshold;

receive a second command to dynamically determine the threshold for skipping the operation associated with the node;

dynamically determining the threshold for skipping the operation associated with the node based on a distribution of a plurality of activation values associated with at least one layer included in the machine learning model, the machine learning model comprising a plurality of layers including an input layer that receives an input signal and an output layer that outputs an output signal and a plurality of hidden layers positioned between the input layer and the output layer to receive a signal from the input layer, extract features, and transmit the features to the output layer, wherein the plurality of layers includes a first layer including a first set of nodes and a second layer including a second set of nodes;

acquiring an activation value associated with at least one layer included in the machine learning model;

determining whether the activation value is less than the threshold; and

if the activation value is less than the threshold, controlling the operations of the machine learning model such that an operation associated with the activation value is skipped in the machine learning model,

wherein the activation value includes an output value from the first layer included in the machine learning model, and

the controlling the operations of the machine learning model includes, if operations associated with the second layer into which an output value from the first layer is input are performed and the output value from the first layer is less than the threshold, controlling the operations of the machine learning model by transmitting a skip command associated with the output value and storing the output value in a memory without changing the output value to zero such that an operation associated with the output value from the first layer is skipped in the second layer, performing the machine learning model by performing operations associated with the second set of nodes of the second layer and associated with values unassociated with the skip command and skipping the operations associated with the second set of nodes of the second layer and associated with the output value associated with the skip command regardless of whether the output value stored in the memory is zero;

wherein the determining the threshold includes:

when the activation value is less than the threshold, decreasing the threshold by a predetermined amount;

increasing a counter value indicating a time during which the threshold does not change when it is determined that the activation value is equal to or greater than the threshold; and

if the counter value reaches a predetermined value, resetting the counter value and updating the threshold such that the threshold is increased by the predetermined amount.