| CPC G06V 10/52 (2022.01) [G06T 7/11 (2017.01); G06V 10/26 (2022.01); G06V 10/7715 (2022.01)] | 20 Claims |

|
1. A computer vision system comprising:
one or more processors; and
memory comprising instructions that, when executed by the one or more processors, cause the one or more processors to:
determine a semantic multi-scale context feature and an instance multi-scale context feature of an input scene;
generate a joint attention map based on the semantic multi-scale context feature and the instance multi-scale context feature;
refine the semantic multi-scale context feature and instance multi-scale context feature based on the joint attention map; and
generate a panoptic segmentation image based on the refined semantic multi-scale context feature and the refined instance multi-scale context feature.
|