| CPC G06V 30/153 (2022.01) [G06V 10/7747 (2022.01); G06V 10/82 (2022.01); G06V 30/1823 (2022.01)] | 20 Claims |

|
1. A computer program product for performing optical character recognition processing of an image, the computer program product comprising a computer readable storage medium having computer readable program code embodied therein that is executable to perform operations, the operations comprising:
providing a vector encoder trained to encode images, comprising digital images representing text, into vectors in a vector space, wherein vectors of images representing similar text have a high degree of cohesion in the vector space, and wherein vectors of images representing dissimilar text have a low degree of cohesion in the vector space;
processing an input image to determine sub-images, of the input image, wherein the sub-images bound the text represented in the input image;
inputting the sub-images to the vector encoder to output sub-image vectors, wherein the sub-image vectors represent the sub-images in the vector space;
using the vector encoder to generate a search vector for search text, wherein the search vector represents the search text in the vector space;
determining the sub-image vectors that match the search vector; and
applying optical character recognition to at least one region of the input image including the sub-images having the sub-image vectors matching the search vector based on closeness of the sub-image vectors and the search vector in the vector space.
|