CPC G06V 20/48 (2022.01) [G06F 16/783 (2019.01); G06V 10/761 (2022.01); G06V 10/82 (2022.01); G06V 20/41 (2022.01); G06V 20/46 (2022.01)] | 20 Claims |
1. A method for video frame processing, comprising:
obtaining a convolutional neural network (CNN) feature of a target video frame and a local feature of the target video frame, the local feature of the target video frame comprising a first key point of the target video frame and a feature descriptor corresponding to the first key point;
performing dimension reduction on the CNN feature of the target video frame to obtain a CNN feature with a reduced dimension of the target video frame;
obtaining a first video frame from a plurality of sample video frames, a distance between a CNN feature with a reduced dimension of the first video frame and the CNN feature with the reduced dimension of the target video frame meeting a first preset condition;
obtaining a local feature of the first video frame, the local feature of the first video frame comprising a second key point in the first video frame and a feature descriptor corresponding to the second key point;
calculating a matching degree between the local feature of the first video frame and the local feature of the target video frame; and
determining the first video frame as a duplicate video frame of the target video frame if the matching degree meets a second preset condition.
|