| CPC G06T 7/11 (2017.01) [G06T 7/73 (2017.01); G06V 10/7715 (2022.01); G06V 10/806 (2022.01); G06V 10/82 (2022.01); G06V 20/70 (2022.01); G06T 2207/20016 (2013.01); G06T 2207/20084 (2013.01)] | 18 Claims |

|
1. A method for segmenting objects in a scene by an electronic device, the method comprising:
inputting at least one input frame of the scene into a pre-trained neural network model, the scene comprising a plurality of objects;
determining a position and a shape of each object of the plurality of objects in the scene using the pre-trained neural network model;
determining an array of coefficients for pixels associated with each object of the plurality of objects in the scene using the pre-trained neural network model; and
generating a segment mask for each object of the plurality of objects based on the position, the shape, and the array of coefficients for each object of the plurality of objects in the scene,
wherein the generating the segment mask for each object of the plurality of objects comprises:
obtaining semantically aware center maps and shape aware prototype masks associated with each object of the plurality of objects in the scene,
determining a linear combination of the semantically aware center maps and the shape aware prototype masks weighted by corresponding coefficients of the array of coefficients on each center location, and
generating the segment mask for each object of the plurality of objects based on the linear combination of the semantically aware center maps and the shape aware prototype masks.
|