| CPC G06N 3/08 (2013.01) [G06N 3/04 (2013.01)] | 16 Claims |

|
1. A computer-implemented method for reducing resource requirements for a neural model comprising:
given a trained multi-dimensional neural network comprising one or more neural network (NN) layers:
for each neural network layer from a set of one or more neural network layers from the trained multi-dimensional neural network, using tensor ring (TR) more neural network layers from the trained multi-dimensional neural network, using tensor ring (TR) decomposition to approximate the neural network layer to obtain a TR-decomposed NN layer and to obtain a corresponding TR-decomposed multi-dimensional neural network that comprises one or more TR-decomposed NN layers;
using validation data on the TR-decomposed NN layer or on the TR-decomposed multi-dimensional neural network to determine if an output from the TR-decomposed NN layer or the TR-decomposed multi-dimensional neural network is within an acceptable threshold as compared to an output from a corresponding NN layer from the trained multi-dimensional neural network or from the trained multi-dimensional neural network;
responsive to the output not meeting an acceptable threshold value:
increasing a rank for TR-decomposition of one or more of the NN layers in the set of one or more neural network layers; and
returning to the step of, for each neural network layer from a set of one or more neural network layers from the trained multi-dimensional neural network, using tensor ring (TR) decomposition to approximate the neural network layer to obtain a TR-decomposed NN layer and to obtain a corresponding TR-decomposed multi-dimensional neural network that comprises one or more TR-decomposed NN layers; and
responsive to the output meeting an acceptable threshold value:
re-training the TR-decomposed multi-dimensional neural network using a training dataset until a stop condition is reached; and
outputting the re-trained TR-decomposed multi-dimensional neural network;
wherein the trained multi-dimensional neural network is a trained three-dimensional convolutional neural network comprising one or more convolutional neural network layers and the method comprises:
given the trained three-dimensional convolutional neural network (3D CNN) comprising one or more convolutional neural network (CNN) layers:
for each CNN layer from a set of one or more CNN layers from the trained 3D CNN, using tensor ring (TR) decomposition to approximate the CNN layer to obtain a TR-decomposed CNN layer and to obtain a corresponding TR-decomposed 3D CNN that comprises one or more TR-decomposed CNN layers;
using validation data on the TR-decomposed CNN layer or on the TR-decomposed 3D CNN to determine if an output is within an acceptable threshold as compared to an output from a corresponding CNN layer from the trained 3D CNN or from the trained 3D CNN;
responsive to the output not meeting an acceptable threshold value:
increasing a rank for TR-decomposition of one or more of the CNN layers in the set of one or more CNN layers; and
returning to the step of, for each CNN layer from a set of one or more CNN layers from the trained 3D CNN, using tensor ring (TR) decomposition to approximate the CNN layer to obtain a TR-decomposed CNN layer and to obtain a corresponding TR-decomposed 3D CNN that comprises one or more TR-decomposed CNN layers; and
responsive to the output meeting an acceptable threshold value:
re-training the TR-decomposed 3D CNN using a training dataset until a stop condition is reached; and
outputting the re-trained TR-decomposed 3D CNN, which requires less memory to store than the trained 3D CNN.
|