| CPC G06T 7/55 (2017.01) [G06T 3/18 (2024.01); G06T 7/33 (2017.01); G06T 7/80 (2017.01); G05D 1/0088 (2013.01); G05D 1/0214 (2013.01); G05D 1/0223 (2013.01); G05D 1/0251 (2013.01); G06T 2207/20081 (2013.01); G06T 2207/30244 (2013.01); G06T 2207/30252 (2013.01)] | 7 Claims |

|
1. A method comprising:
estimating correspondences between keypoints of a target camera image and keypoints of a context camera image, wherein the target camera image and the context camera image are obtained from a monocular sequence;
using a ray surface decoder to predict a ray surface from the target image, wherein the predicted ray surface associates a respective pixel in the target image with a corresponding direction;
based on the keypoint correspondences and the predicted ray surface, lifting a set of 2D keypoints to 3D, using a neural camera model; and
projecting the 3D keypoints into the context camera image using the neural camera model.
|