CPC G06T 5/77 (2024.01) [G06T 3/18 (2024.01); G06T 3/4046 (2013.01); G06T 5/50 (2013.01); G06T 7/30 (2017.01); G06T 7/50 (2017.01); G06T 7/90 (2017.01); G06T 2207/20084 (2013.01); G06T 2207/20221 (2013.01)] | 20 Claims |
1. A non-transitory computer-readable medium comprising instructions that, when executed by at least one processor, cause a computing device to:
generate, utilizing a trained depth prediction network, a monocular depth prediction for a source image of an object or a scene;
determine a relative camera matrix between a target image of the object or the scene and the source image based on a plurality of matching correspondence points between the source image and the target image, wherein the source image differs from the target image;
determine a rescaled depth prediction based on the monocular depth prediction and the relative camera matrix; and
generate a reprojected image comprising at least a portion of the source image warped based on the rescaled depth prediction and the relative camera matrix to align with the target image.
|