CPC G06T 7/194 (2017.01) [G06N 3/08 (2013.01); G06T 7/11 (2017.01)] | 19 Claims |
1. A computerized system, the system comprising:
one or more processors; and
computer storage memory having computer-executable instructions stored thereon which, when executed by the one or more processors, implement a method comprising:
receiving an input image;
deriving, via at least a first model, a first mask and a second mask, the first mask indicates a set of objects in the input image belonging to a first object class, the second mask defines each instance of the set of objects;
generating a feature map by concatenating one or more features from at least: the input image, the first mask, and the second mask;
based on the feature map, generating, via at least a second model, a third mask, the third mask indicates which pixels of the input image correspond to a foreground of the input image, the foreground excludes pixels corresponding to a background of the input image; and
based on the generating of the third mask, causing presentation of an output image associated with the input image, wherein the output image includes a fourth mask that defines a second instance of the set of objects, the second instance not being defined in the second mask.
|