| CPC G06T 17/20 (2013.01) [G06T 7/73 (2017.01); G06T 13/40 (2013.01); G06T 19/006 (2013.01); G06T 2207/20081 (2013.01); G06T 2207/20132 (2013.01); G06T 2207/30201 (2013.01)] | 20 Claims |

|
1. A method comprising:
accessing a monocular image depicting an object;
predicting both a volumetric reconstruction tensor of the monocular image and a pose of the object by applying a first machine learning model to the monocular image;
identifying a portion of the pose of the object that corresponds to a point in a canonical space associated with a set of position encoding information;
obtaining a point of the volumetric reconstruction tensor corresponding to the identified portion of the pose;
classifying the obtained point as being inside or outside of a canonical volume by applying a second machine learning model to the obtained point of the volumetric reconstruction tensor together with the set of position encoding information; and
generating a three-dimensional (3D) mesh representing the object in the canonical space in response to classifying the obtained point as being inside or outside of the canonical volume.
|