CPC G06T 7/223 (2017.01) [G06N 5/01 (2023.01); G06V 10/761 (2022.01); G06V 20/46 (2022.01); G06V 20/48 (2022.01); G06V 20/70 (2022.01)] | 17 Claims |
1. A method comprising:
receiving compressed video data;
extracting macroblocks and motion vectors for a plurality of frames in the compressed video data;
identifying frame-level features for each of the plurality of frames based on the macroblocks and the motion vectors;
calculating similarity features for each of the identified frame-level features based on the frame-level features identified in consecutive frames;
predicting motion for each of the plurality of frames by providing the frame-level features and the similarity features into a model trained to detect motion; and
predicting event boundaries in the compressed video data by providing the frame-level features into a second model trained to identify event boundaries.
|