| CPC G06V 10/771 (2022.01) [G06T 5/20 (2013.01)] | 9 Claims |

|
1. An information processing apparatus comprising at least one memory configured to store an instruction, and at least one processor configured to execute the instruction to:
use at least one mask channel derived from one or more input feature maps of a set to mask pixels of at least one feature channel derived from the one or more input feature maps of the set and to generate at least one masked feature channel;
perform a convolution operation between the at least one masked feature channel and convolution kernels to generate output feature maps;
calculate task loss from a prediction and groundtruth data of an image;
calculate a mask loss from mask channels of the output feature maps and groundtruth mask of the image;
calculate a total loss from the task loss and the mask loss; and
train a convolutional neural network based on the total loss to obtain an updated convolutional neural network.
|