CPC G06V 10/25 (2022.01) [G06V 10/82 (2022.01); G06V 10/87 (2022.01)] | 16 Claims |
1. A method of inferring an object in a moving image using a convolutional neural network (CNN) model, performed by an electronic device, the method comprising:
identifying a first region of interest in a first frame among a plurality of frames in the moving image, and a first object in the first region of interest, by providing the first frame to a plurality of convolution layer groups sequentially connected in the CNN model;
identifying a second region of interest in a second frame among the plurality of frames, the second region of interest corresponding to the first region of interest, and the second frame being after the first frame;
providing the second region of interest to the CNN model, and obtaining first output data that is output from a first convolution layer group from among the plurality of convolution layer groups; and
determining whether to identify a second object in the second region of interest by using a second convolution layer group from among the plurality of convolution layer groups, based on the first output data, wherein the second convolution layer group sequentially follows the first convolution layer group within the plurality of convolution layer groups.
|