CPC G06F 16/583 (2019.01) [G06F 16/5866 (2019.01); G06F 18/22 (2023.01)] | 20 Claims |
19. A computer-implemented method, comprising:
generating, according to user behavior data associated with a specified object category and object description information of the specified object category in a manufacturing industry, a style description text for the specified object category, wherein the style description text reflects a style requirement of the specified object category;
inputting the style description text into a text prediction-based first image generation model for image generation, to obtain a plurality of initial object images, wherein the first image generation model comprises an encoder-decoder structure implemented based on a vector quantization generative adversarial network (VQGAN) and a sparse attention mechanism;
inputting the plurality of initial object images and the style description text into a second image-text matching model for matching, to obtain at least one candidate object image of which a matching degree meets a threshold requirement; and
displaying the at least one candidate object image to an evaluation system, obtaining a selected target object image in response to selection of the evaluation system, and using the target object image for a subsequent manufacturing link.
|