CPC G06T 11/60 (2013.01) [G10L 15/16 (2013.01); G10L 15/22 (2013.01); G10L 2015/223 (2013.01)] | 19 Claims |
8. A computer-implemented method comprising:
retrieving a source image and a natural language request;
inferring, using an operation classifier model, an image editing operation from the natural language request;
generating, using a grounding model, an image mask for an object or region of the source image that is inferred to correspond to the image editing operation;
performing, using a submodule of an operation modular network, the image editing operation on the source image, wherein the submodule is configured to infer one or more parameters used for performing the image editing operation; and
outputting a modified source image.
|