CPC G06V 10/764 (2022.01) [G06F 18/214 (2023.01); G06F 18/23 (2023.01); G06F 18/251 (2023.01); G06F 18/29 (2023.01); G06N 5/02 (2013.01); G06V 10/82 (2022.01); G06V 20/62 (2022.01); G06V 40/16 (2022.01)] | 17 Claims |
1. A method for automatically generating a training data set for object recognition, comprising:
obtaining profile information for a plurality of objects; and
for each object from the plurality of objects:
collecting a group of initial images associated with the object based on an identity information of the object included in the profile information of the object;
filtering the group of initial images to obtain a group of filtered images associated with the object, wherein filtering the group of initial images further comprises, for each initial image:
calculating a first relevance score based on a similarity between the initial image and an image in the profile information of the object;
calculating a second relevance score based on a similarity between a description of the initial image and a description of the image in the profile information of the object;
determining that the initial image is a noisy image based on the first relevance score and the second relevance score; and
removing the initial image from the group of initial images in response to the determining that the initial image is a noisy image;
generating a group of training data pairs corresponding to the object by labeling each of the group of filtered images with the identity information of the object;
adding the group of training data pairs into the training data set; and
training an image recognition model based on the training data set, wherein the trained image recognition model is configured to perform image recognition for an input image.
|