CPC G02B 27/0172 (2013.01) [G06N 3/08 (2013.01); G06T 7/33 (2017.01); G06T 7/74 (2017.01); G06V 10/40 (2022.01); G06T 2207/10016 (2013.01); G06T 2207/20081 (2013.01); G06T 2207/20084 (2013.01); G06T 2207/30244 (2013.01)] | 17 Claims |
1. A method of training a neural network for performing visual odometry, the method comprising:
receiving, by the neural network implemented using one or more computer systems, a plurality of images of an environment;
determining, by the neural network, for each image, a respective set of interest points and a respective descriptor;
determining, by the neural network, a correspondence between the plurality of images based on the sets of interest points and the descriptors of the plurality of images, wherein determining the correspondence between the plurality of images comprises:
determining one or more point correspondences between the sets of interest points, wherein each point correspondence indicates a respective correspondence between an interest point of one image and an interest point of another image, and
determining, based on the one or more point correspondences, a set of candidate interest points, each candidate interest point indicating a respective feature in the environment in three-dimensional space; and
determining, for each candidate interest point, a respective stability metric, including determining that a first candidate interest point is stable, wherein determining that the first candidate interest point is stable comprises:
determining that the first candidate interest point is depicted in a number of images of the plurality of images exceeding a first threshold number, and
determining that a re-projection error associated with the first candidate interest point is less than a first threshold error level; and
modifying the neural network based on the one or more candidate interest points.
|