US 12,153,619 B2
Generative prompt expansion for image generation
Ritiz Tambi, San Francisco, CA (US); and Ajinkya Gorakhnath Kale, San Jose, CA (US)
Assigned to ADOBE INC., San Jose, CA (US)
Filed by ADOBE INC., San Jose, CA (US)
Filed on Sep. 20, 2022, as Appl. No. 17/933,595.
Prior Publication US 2024/0095275 A1, Mar. 21, 2024
Int. Cl. G06F 16/00 (2019.01); G06F 16/532 (2019.01)
CPC G06F 16/532 (2019.01) 19 Claims
OG exemplary drawing
 
1. A method comprising:
identifying an original query comprising an image description;
tokenizing the original query to obtain a plurality of original tokens;
generating a plurality of expanded queries by generating a plurality of additional phrases based on the original query using a causal language model (CLM), wherein each of the plurality of expanded queries comprises an expanded image description that includes the original query and an additional phrase from the plurality of additional phrases, wherein the CLM generates each of the plurality of additional phrases by generating a plurality of sequences of tokens, respectively, and wherein each token of each sequence of the plurality of sequences of tokens is generated based on the plurality of original tokens and a sequence of previously generated tokens;
inserting a mask token at a target location of each of the plurality of expanded queries;
replacing the mask token with an insertion phrase using a masked language model (MLM) to obtain a plurality of modified queries, respectively; and
providing a plurality of images in response to the original query, wherein the plurality of images are associated with the plurality of modified queries, respectively.