| CPC G06F 21/6245 (2013.01) [G06F 40/20 (2020.01); G06T 11/00 (2013.01); G06V 10/25 (2022.01); G06V 10/759 (2022.01); G06V 10/761 (2022.01); G06V 10/774 (2022.01); G06V 40/172 (2022.01); G06V 2201/10 (2022.01)] | 15 Claims |

|
1. An information processing apparatus, comprising:
at least one processor configured to:
estimate a plurality of candidate regions of object detection from a first image;
estimate a topic of the first image based on text information, wherein the text information accompanies the first image;
evaluate the plurality of candidate regions based on relationships with the topic;
determine, based on a first object detector that relates to the topic, a candidate region of the plurality of candidate regions as a region of interest, and candidate regions of the plurality of candidate regions, other than the candidate region, as regions of non-interest, wherein
the candidate region has a specific relationship with the topic, and
the candidate region is a region in which an object that relates to the topic is detected;
detect, in a case where the topic is an object name, objects corresponding to the topic from the plurality of candidate regions based on a second object detector that relates to the object name;
collect, in a case where the first object detector is not prepared in advance, an image group having tag information that relates to the topic;
detect the objects based on a third object detector, wherein a learning of the third object detector is based on the image group; and
generate a second image based on the detection of the objects.
|