CPC G06V 10/771 (2022.01) [G06T 7/70 (2017.01); G06T 17/00 (2013.01); G06V 10/44 (2022.01); G06V 10/751 (2022.01); G06T 2207/20081 (2013.01); G06T 2207/20084 (2013.01)] | 20 Claims |
1. A method of training a neural network to predict correspondences in images, the method comprising:
generating, by one or more processors of a processing system and using the neural network, a first feature map based on a first image of a subject, and a second feature map based on a second image of the subject, the first image and the second image being different and having been generated using a three-dimensional model of the subject;
determining, by the one or more processors, a first feature distance between a first point as represented in the first feature map and a second point as represented in the second feature map, the first point and the second point corresponding to the same feature on the three-dimensional model of the subject;
determining, by the one or more processors, a second feature distance between a third point as represented in the first feature map and a fourth point as represented in the first feature map;
determining, by the one or more processors, a first geodesic distance between the third point and the fourth point as represented in a first surface map, the first surface map corresponding to the first image and having been generated using the three-dimensional model of the subject;
determining, by the one or more processors, a third feature distance between the third point as represented in the first feature map and a fifth point as represented in the first feature map;
determining, by the one or more processors, a second geodesic distance between the third point and the fifth point as represented in the first surface map;
determining, by the one or more processors, a first loss value of a set of loss values, the first loss value being based on the first feature distance;
determining, by the one or more processors, a second loss value of the set of loss values, the second loss value being based on the second feature distance, the third feature distance, the first geodesic distance, and the second geodesic distance; and
modifying, by the one or more processors, one or more parameters of the neural network based at least in part on the set of loss values.
|