CPC G06V 20/49 (2022.01) [G06F 18/214 (2023.01); G06F 18/23 (2023.01); G06F 18/251 (2023.01); G06F 40/205 (2020.01); G06V 20/46 (2022.01)] | 20 Claims |
1. A computer-implemented method comprising:
receiving a user input and an input video comprising a plurality of frames;
generating a plurality of segmentation masks for the plurality of frames;
determining a set of reference masks corresponding to the user input and an object;
generating a set of fusion masks by combining the plurality of segmentation masks and the set of reference masks;
propagating the set of fusion masks between the plurality of segmentation masks; and
outputting a final set of masks for the input video.
|