| CPC G06F 3/013 (2013.01) [G06T 7/70 (2017.01); G06V 10/454 (2022.01); G06V 10/62 (2022.01); G06V 10/761 (2022.01); G06V 10/771 (2022.01); G06V 40/171 (2022.01); G06T 2207/30201 (2013.01)] | 20 Claims |

|
1. A method performed by an electronic device, the method comprising:
obtaining target information of an image, the image comprising an eye;
obtaining a target feature map representing information on the eye in the image, by extracting features from a first feature map of at least two frame images and the target information based on an offset between pixels of a face in the image and a first front image obtained by offsetting the pixels of the face in the image and applying a facial mask covering a region other than the face in the image to the image; and
performing gaze estimation for the eye in the image based on the target feature map,
wherein the target information comprises either attention information on the image, or a distance between pixels in the image, or both,
wherein the attention information comprises temporal relationship information between the at least two frame images and frontal facial features of the face or a head, and
wherein the frontal facial features are determined based on obtaining a facial map and the facial mask of the image.
|