| CPC G06N 3/088 (2013.01) [G06F 18/2137 (2023.01); G06N 3/047 (2023.01); G06V 30/274 (2022.01)] | 20 Claims |

|
1. A method of training an interpretable deep learning model for a machine learning system, comprising:
receiving an input set of data;
providing the input set of data to a deep neural network model;
extracting features from the deep neural network model;
generating a latent space of vectors comprising the extracted features;
feeding the latent space of vectors generated from extracted features of the deep neural network model to a task-specific model, wherein the task-specific model is a low-complexity and linear learning model;
generating interpretable predictions of feature dimensions from the task-specific model;
reconstructing the input set of data using a decoder module;
determining a reconstruction error loss from reconstructing the input set of data;
determining a classification loss or a regression loss from a task specific output set of data; and
training an autoencoder, the decoder module, and the low-complexity learning model, using a combination of (i) the reconstruction error loss and (ii) the classification loss or the regression loss.
|