| CPC G06V 20/70 (2022.01) [G06V 10/26 (2022.01); G06V 10/761 (2022.01)] | 20 Claims |

|
1. A computer-implemented method, comprising:
receiving first image data representing a first image;
generating second image data representing a portion of the first image that is generated by cropping the first image data according to a first aspect ratio of a first display of a target device;
generating, by inputting the first image data into an image captioning model, first text data representing a first description of first content of the first image data;
generating, using a first encoder, a first vector representation of the first text data;
generating, by inputting the second image data into the image captioning model, second text data representing a second description of second content of the second image data;
generating, using the first encoder, a second vector representation of the second text data;
determining a first cosine similarity score between the first vector representation and the second vector representation; and
generating, based at least in part on the first cosine similarity score, first computer-executable instructions to cause the second image data to be displayed on the first display of the target device.
|