| CPC G06V 10/44 (2022.01) [G06V 10/761 (2022.01); G06V 40/176 (2022.01); G06V 40/23 (2022.01)] | 15 Claims |

|
1. An electronic apparatus comprising:
a camera;
a memory storing one or more instructions; and
at least one processor configured to execute the one or more instructions stored in the memory, wherein the at least one processor, by executing the one or more instructions, is further configured to:
obtain feature information corresponding to each of a plurality of image frames using the at least one network model,
identify one of the plurality of image frames as a best image frame, based on feature information corresponding to the best image frame recognized by the at least one network model, and
provide the identified best image frame,
wherein the at least one network model is a model trained to output feature information corresponding to an input image frame based on facial expression feature and body feature of a person included in the input image frame, and
wherein the plurality of image frames include image frames obtained by the camera during a predetermined first time period before a user selection of a user interface button to capture an image is received and image frames obtained by the camera during a predetermined second time period after the user selection is received.
|