US 11,756,291 B2
	Scene and user-input context aided visual search
Adam Turkelson, Washington, DC (US); Kyle Martin, Washington, DC (US); and Sethu Hareesh Kolluru, Washington, DC (US)
Assigned to Slyce Acquisition Inc., Philadelphia, PA (US)
Filed by Slyce Acquisition Inc., Washington, DC (US)
Filed on Aug. 17, 2020, as Appl. No. 16/995,531.
Application 16/995,531 is a continuation in part of application No. 16/719,697, filed on Dec. 18, 2019, abandoned.
Application 16/719,697 is a continuation in part of application No. 16/719,710, filed on Dec. 18, 2019, granted, now 10,755,128.
Application 16/719,710 is a continuation in part of application No. 16/719,716, filed on Dec. 18, 2019, granted, now 10,977,520.
Claims priority of provisional application 62/781,422, filed on Dec. 18, 2018.
Claims priority of provisional application 62/781,446, filed on Dec. 18, 2018.
Claims priority of provisional application 62/781,414, filed on Dec. 18, 2018.
Claims priority of provisional application 62/781,432, filed on Dec. 18, 2018.
Prior Publication US 2021/0004589 A1, Jan. 7, 2021
Int. Cl. G06K 9/00 (2022.01); G06V 10/82 (2022.01); G06T 5/00 (2006.01); G06V 30/19 (2022.01); G06V 10/80 (2022.01); G06F 3/01 (2006.01); G06F 3/041 (2006.01)

CPC G06V 10/82 (2022.01) [G06T 5/00 (2013.01); G06V 10/809 (2022.01); G06V 30/19173 (2022.01); G06F 3/013 (2013.01); G06F 3/017 (2013.01); G06F 3/041 (2013.01)]

20 Claims

1. A tangible, non-transitory, computer-readable medium storing computer program instructions that when executed by one or more processors effectuate operations comprising:

obtaining, with a computer system, a first training set to train a computer vision model, the first training set comprising images depicting objects and labels corresponding to object identifiers and indicating which object is depicted in respective labeled images;

training, with the computer system, the computer vision model to detect the objects in other images based on the first training set, wherein training the computer vision model comprises:

encoding depictions of objects in the first training set as vectors in a vector space of lower dimensionality than at least some images in the first training set, and

designating, based on the vectors, locations in the vector space as corresponding to object identifiers;

obtaining a first vector encoding a first depiction of a first object in a first query image;

determining similarity between the first depiction of the first object in the first query image and another object in an image of the first training set based on a first distance between the first vector and a second vector associated with the another object in the image of the first training set,

designating as a first object identifier associated with the first object in the first query image, an object identifier associated by the trained computer vision model with the another object in the image of the first training set;

determining, with the computer system, based on the first distance between the first location of the first vector in the vector space and the second location of the second vector in the vector space, to add the first image or data based on the first image to a second training set, wherein:

the second training set comprises both at least some of the images from the first training set and the first image,

the first image is absent from the first training set, and

the first image is labeled as depicting the first object in the second training set; and

training, with the computer system, the computer vision model with the second training set.