US 11,670,023 B2
Artificial intelligence techniques for performing image editing operations inferred from natural language requests
Ning Xu, Milpitas, CA (US); Trung Bui, San Jose, CA (US); Jing Shi, Rochester, NY (US); and Franck Dernoncourt, Sunnyvale, CA (US)
Assigned to Adobe Inc., San Jose, CA (US)
Filed by Adobe Inc., San Jose, CA (US)
Filed on Aug. 31, 2020, as Appl. No. 17/7,693.
Prior Publication US 2022/0067992 A1, Mar. 3, 2022
Int. Cl. G06T 7/00 (2017.01); G06T 19/20 (2011.01); G06T 11/60 (2006.01); G10L 15/16 (2006.01); G10L 15/22 (2006.01)
CPC G06T 11/60 (2013.01) [G10L 15/16 (2013.01); G10L 15/22 (2013.01); G10L 2015/223 (2013.01)] 19 Claims
OG exemplary drawing
 
8. A computer-implemented method comprising:
retrieving a source image and a natural language request;
inferring, using an operation classifier model, an image editing operation from the natural language request;
generating, using a grounding model, an image mask for an object or region of the source image that is inferred to correspond to the image editing operation;
performing, using a submodule of an operation modular network, the image editing operation on the source image, wherein the submodule is configured to infer one or more parameters used for performing the image editing operation; and
outputting a modified source image.