CPC G06T 7/11 (2017.01) [G06N 3/04 (2013.01); G06T 7/149 (2017.01); G06T 7/168 (2017.01); G06T 7/174 (2017.01); G06V 10/82 (2022.01); G06V 20/46 (2022.01); G06T 2207/10016 (2013.01); G06T 2207/20084 (2013.01)] | 12 Claims |
1. A method with video segmentation, comprising: acquiring, overtime, a video sequence comprising a plurality of image frames, the plurality of image frames including a second image frame corresponding to a time t of the video sequence and a first image frame corresponding to a time t−1 before the time t;
extracting a second feature vector from the second image frame;
generating second hidden state information corresponding to the second image frame, by using information related to the second image frame based on a relation between a first feature vector corresponding to at least one object included in the second image frame stored in a memory and the second feature vector, wherein the information related to the second image frame is determined to have a predetermined relation with the second image frame from among hidden state information of a plurality of the image frames corresponding to a time before the time t;
generating a second segmentation mask corresponding to the second image frame, based on an output vector corresponding to the second hidden state information;
determining a dissimilarity based on an entropy-based correlation between the hidden state information stored in the memory and the second feature vector;
storing the second hidden state information in the memory, based on a result of comparing the dissimilarity to a preset reference value; and
outputting the second segmentation mask.
|