| CPC G06V 10/761 (2022.01) [G06V 10/764 (2022.01)] | 20 Claims |

|
1. A training method performed by a computing apparatus, the training method comprising:
generating an anchor image embedding vector for an anchor image using an image representation model;
determining first similarities between the anchor image and negative samples of the anchor image using first image embedding vectors for the negative samples and the generated anchor image embedding vector;
determining second similarities between the anchor image and positive samples of the anchor image using second image embedding vectors for the positive samples and the generated anchor image embedding vector;
obtaining one of a vector corresponding to a label of the anchor image and third similarities between the label of the anchor image and labels of the negative samples;
determining a loss value for the anchor image based on (i) the determined first similarities, (ii) the determined second similarities, and (iii) one of the obtained third similarities and a fourth similarity, wherein the fourth similarity is a similarity between the obtained vector and the generated anchor image embedding vector; and
updating weights of the image representation model based on the determined loss value.
|