| CPC G06V 20/46 (2022.01) [G06N 3/08 (2013.01); G06V 10/82 (2022.01)] | 12 Claims |

|
1. An apparatus for predicting a video frame, the apparatus comprising:
a level encoder configured to extract and learn at least one feature from a video frame;
a feature learning unit configured to learn based on the at least one feature or transmit predicted feature data corresponding to the at least one feature; and
a level decoder configured to obtain and learn a predicted video frame based on the predicted feature data,
wherein the level encoder receives first to (T−1)th video frames, respectively, and extracts at least one feature from each of the first to (T−1)th video frames, where “T” includes a natural number equal to or greater than 2,
wherein the feature learning unit is trained based on at least one feature extracted from each of the first to (T−1)th video frames,
wherein the level encoder receives the T-th video frame,
wherein the level decoder obtains a (T+1)th predicted video frame corresponding to the T th video frame,
wherein the level encoder receives the (T+1)th predicted video frame, and
wherein the level decoder obtains a (T+2)th predicted video frame corresponding to the (T+1)th predicted video frame.
|