US 11,908,053 B2
Method, non-transitory computer-readable storage medium, and apparatus for searching an image database
Sandra Mau, Pittsburgh, PA (US); Joshua Song, Capalaba (AU); Sabesan Sivapalan, Heathwood (AU); and Sreeja Krishnan, Pittsburgh, PA (US)
Assigned to CAMELOT UK BIDCO LIMITED, London (GB)
Filed by CAMELOT UK BIDCO LIMITED, London (GB)
Filed on May 28, 2021, as Appl. No. 17/333,707.
Claims priority of provisional application 63/032,432, filed on May 29, 2020.
Prior Publication US 2021/0374460 A1, Dec. 2, 2021
Int. Cl. G06T 7/10 (2017.01); G06T 11/60 (2006.01); G06V 30/146 (2022.01); G06V 10/82 (2022.01); G06V 10/44 (2022.01); G06V 10/26 (2022.01)
CPC G06T 11/60 (2013.01) [G06T 7/10 (2017.01); G06V 10/26 (2022.01); G06V 10/454 (2022.01); G06V 10/82 (2022.01); G06V 30/147 (2022.01); G06T 2207/20084 (2013.01); G06T 2210/12 (2013.01)] 14 Claims
OG exemplary drawing
 
1. A method for searching an image database, comprising:
receiving, by processing circuitry, an adulterated image of an object, the adulterated image including object annotations for visual reference;
applying, by the processing circuitry, a first neural network to the adulterated image;
correlating, by the processing circuitry, a result of the applying the first neural network with each image of a reference database of images, the result including an edited image of the object and each image of the reference database of images including a reference object; and
selecting, by the processing circuitry and as a matching image, one or more images of the reference database of images having correlation values above a threshold correlation value, wherein the method further comprises
applying, by the processing circuitry, a masking process to the adulterated image, including
performing, by the processing circuitry and via a second neural network, object recognition on the adulterated image to recognize text-based descriptive features,
applying, by the processing circuitry and based on the performing the object recognition, computer vision to detect callout features relative to bounding boxes containing the recognized text-based descriptive features, and
generating, by the process circuitry and based on the bounding boxes and the detected callout features, a contour mask of the object.