| CPC B25J 9/1697 (2013.01) [B25J 9/1664 (2013.01); B25J 9/1669 (2013.01); G06T 7/0002 (2013.01); G06T 7/70 (2017.01); G06T 7/74 (2017.01); G06V 10/443 (2022.01); G06V 10/751 (2022.01); G06V 10/7515 (2022.01); G06V 10/7715 (2022.01); G06V 10/776 (2022.01); G06V 20/50 (2022.01); G06V 20/653 (2022.01); G06T 2207/30168 (2013.01); G06V 2201/06 (2022.01)] | 20 Claims |

|
1. A computing system comprising:
at least one processing circuit in communication with a robot, having an arm and an end-effector connected thereto, and a camera having a field of view and configured, when one or more objects are or have been in the field of view, to execute instructions stored on a non-transitory computer-readable medium for:
obtaining object image information of an object in a scene;
generating one or more detection hypotheses, wherein the one or more detection hypotheses correspond to an object recognition template representing a template object; and
validating each detection hypothesis of the one or more detection hypotheses by:
generating one or more three-dimensional validation scores based on comparing three-dimensional information of the object recognition template of the one or more detection hypotheses and three-dimensional information of the object image information corresponding to the object, the one or more three-dimensional validation scores including at least one of an occlusion validator score, a three-demensional information validator score, a hole matching validator score, and a normal vector validator score;
generating one or more two-dimensional validation scores based on comparing two-dimensional information of the corresponding object recognition template of the one or more detection hypotheses and two-dimensional information of the object image information, the one or more two-dimensional validation scores including at least one of a rendered match validator score and a template match validator score;
selecting a detection hypothesis from the one or more detection hypotheses according to the one or more three-dimensional validation scores and the one or more two-dimensional validation scores; and
generating a detection result for the object in the scene according to the selected detection hypothesis.
|