CPC G06V 10/751 (2022.01) [G06F 18/214 (2023.01); G06F 18/25 (2023.01); G06N 3/08 (2013.01)] | 20 Claims |
1. A non-transitory computer-readable medium storing instructions that, when executed by at least one processor, cause a computing device to:
generate an image-object feature map reflecting attributes from a digital image portraying an object utilizing an embedding neural network;
generate a localized object attention feature vector reflecting a segmentation prediction of the object portrayed in the digital image from the image-object feature map utilizing a localizer neural network; and
determine a plurality of attributes for the object portrayed within the digital image from a combination of the localized object attention feature vector and the image-object feature map utilizing a classifier neural network.
|