| CPC G06V 10/771 (2022.01) [G06V 10/462 (2022.01); G06V 10/7715 (2022.01); G06V 10/80 (2022.01); G06V 10/82 (2022.01)] | 19 Claims |

|
1. A non-transitory computer-readable data storage medium storing program code executable by a processor to perform processing comprising:
generating a saliency map of an image;
identifying a plurality of saliency regions of the saliency map;
merging the saliency regions into a combined saliency region;
generating a plurality of candidate image crops of the image based on the combined saliency region; and
selecting an image crop of the image from the candidate image crops using a machine learning model, wherein the machine learning model is a neural network trained as a twin neural network based on reference images and image crops of the reference images using a ranking loss objective in which the image crops are negative samples and the reference images are positive samples.
|