CPC G06N 3/08 (2013.01) [G06F 18/214 (2023.01); G06N 3/04 (2013.01); G06V 10/454 (2022.01); G06V 10/764 (2022.01); G06V 10/82 (2022.01); G06V 20/10 (2022.01); G06V 20/20 (2022.01); G06V 20/64 (2022.01)] | 20 Claims |
1. In a digital medium environment for detecting objects in digital images, a method implemented by a computing device, the method comprising:
obtaining an input digital image that depicts a scene with image objects;
detecting a particular image object depicted in the input digital image with an object detection network that is untrained to detect the particular image object, the object detection network trained by concept conditioning based on an object data set of identified object classes and based on word embedding concepts, the object detection network configured to generalize to untrained object classes by the concept conditioning;
receiving an object search concept as a word input from which a word embedding is generated, the object search concept related to the particular image object depicted in the input digital image, the word embedding indicating a relationship between the object search concept and different word-based concepts; and
generating an output digital image from the input digital image and the word embedding, the output digital image depicting the scene and including one or more indications of object detection results that denote regions of the scene, including at least an indication of the particular image object corresponding to the object search concept.
|