| CPC G06T 7/11 (2017.01) [G06V 10/778 (2022.01); G06V 10/82 (2022.01); G06V 20/70 (2022.01); G06T 2207/20021 (2013.01); G06T 2207/20081 (2013.01); G06T 2207/20084 (2013.01)] | 20 Claims |

|
1. A method comprising:
receiving an image depicting an object;
generating an atrous query matrix for the image by performing an atrous convolution operation based on a plurality of dilation rates;
generating image features for the image by performing an atrous self-attention operation based on the atrous query matrix; and
generating label data that identifies the object based on the image features.
|