US 11,720,942 B1
	Interactive retrieval using visual semantic matching
Loris Bazzani, Berlin (DE); and Yanbei Chen, London (GB)
Assigned to AMAZON TECHNOLOGIES, INC., Seattle, WA (US)
Filed by Amazon Technologies, Inc., Seattle, WA (US)
Filed on Jun. 29, 2020, as Appl. No. 16/915,361.
Claims priority of provisional application 62/934,440, filed on Nov. 12, 2019.
Int. Cl. G06Q 30/06 (2023.01); G06N 3/02 (2006.01); G06F 16/535 (2019.01); G06Q 30/0601 (2023.01); G06T 7/00 (2017.01)

CPC G06Q 30/0613 (2013.01) [G06F 16/535 (2019.01); G06N 3/02 (2013.01); G06T 7/00 (2013.01)]

20 Claims

1. A method of interactive shopping assistance, said method comprising:

training a machine learning product prediction model based at least in part on:

determining, using at least one processor, a predicted output vector based on an encoding of a reference image input and an encoding of a modification text input describing a modification to the reference image input that results in a target image output;

determining, using the at least one processor, a target output vector based at least in part on an encoding of the target image output; and

determining, using the at least one processor, a compositional matching loss based at least in part on a difference between the predicted output vector and the target output vector;

receiving image data from a user, the image data representing an image of an article of clothing;

receiving a modification input from the user, the modification input describing a desired modification to the article of clothing; and

processing the image data and the modification input with the machine learning product prediction model to identify a target product corresponding to the desired modification to the article of clothing; and

sending image data of the target product to the user.