US 12,300,007 B1
Automatic image cropping
Ilya Levner, Woodinville, WA (US); Aditya Ghuge, Seattle, WA (US); and Tabrez Mohammed, Sammamish, WA (US)
Assigned to AMAZON TECHNOLOGIES, INC., Seattle, WA (US)
Filed by Amazon Technologies, Inc., Seattle, WA (US)
Filed on Sep. 30, 2022, as Appl. No. 17/957,360.
Int. Cl. G06V 20/70 (2022.01); G06V 10/26 (2022.01); G06V 10/74 (2022.01); G06V 20/50 (2022.01)
CPC G06V 20/70 (2022.01) [G06V 10/26 (2022.01); G06V 10/761 (2022.01)] 20 Claims
OG exemplary drawing
 
1. A computer-implemented method, comprising:
receiving first image data representing a first image;
generating second image data representing a portion of the first image that is generated by cropping the first image data according to a first aspect ratio of a first display of a target device;
generating, by inputting the first image data into an image captioning model, first text data representing a first description of first content of the first image data;
generating, using a first encoder, a first vector representation of the first text data;
generating, by inputting the second image data into the image captioning model, second text data representing a second description of second content of the second image data;
generating, using the first encoder, a second vector representation of the second text data;
determining a first cosine similarity score between the first vector representation and the second vector representation; and
generating, based at least in part on the first cosine similarity score, first computer-executable instructions to cause the second image data to be displayed on the first display of the target device.