US 11,868,863 B2
Systems and methods for joint learning of complex visual inspection tasks using computer vision
Carlo Dal Mutto, Sunnyvale, CA (US); Francesco Peruch, Sunnyvale, CA (US); Alexander Ou, Sunnyvale, CA (US); and Robert Hayes, Palo Alto, CA (US)
Assigned to Packsize LLC, Salt Lake City, UT (US)
Filed by PACKSIZE LLC, Salt Lake City, UT (US)
Filed on Oct. 13, 2022, as Appl. No. 17/965,428.
Application 17/965,428 is a continuation of application No. 16/721,501, filed on Dec. 19, 2019, granted, now 11,508,050.
Claims priority of provisional application 62/782,163, filed on Dec. 19, 2018.
Prior Publication US 2023/0177400 A1, Jun. 8, 2023
Int. Cl. G06N 20/10 (2019.01); G06T 7/50 (2017.01); G06T 7/90 (2017.01); G06N 3/04 (2023.01); G06T 7/00 (2017.01); G06V 30/148 (2022.01); G06F 18/2411 (2023.01); G06V 10/764 (2022.01); G06V 10/82 (2022.01); G06V 20/80 (2022.01)
CPC G06N 20/10 (2019.01) [G06F 18/2411 (2023.01); G06N 3/04 (2013.01); G06T 7/0002 (2013.01); G06T 7/50 (2017.01); G06T 7/90 (2017.01); G06V 10/765 (2022.01); G06V 10/82 (2022.01); G06V 20/80 (2022.01); G06V 30/153 (2022.01)] 20 Claims
OG exemplary drawing
 
1. A method for performing automatic visual inspection, comprising:
capturing visual information of an object using a scanning system comprising a plurality of cameras;
extracting, by a computing system comprising a processor and memory, one or more feature maps from the visual information using one or more feature extractors;
classifying, by the computing system, the object by supplying the one or more feature maps to a complex classifier to compute a classification of the object, the complex classifier comprising:
a plurality of simple classifiers, each simple classifier of the plurality of simple classifiers being configured to compute outputs representing a characteristic of the object and one or more of the simple classifiers comprising a K-ary output, where K is greater than two,
a decision tree connecting each simple classifier of the plurality of simple classifiers, wherein:
the K-ary output selects particular branches of the decision tree to take, and
different branches of the decision tree are associated with different trained models, and
one or more logical operators configured to combine the outputs of the simple classifiers to compute the classification of the object; and
outputting, by the computing system, the classification of the object as a result of the automatic visual inspection.