| CPC G06V 30/412 (2022.01) [G06V 30/19013 (2022.01); G06V 30/414 (2022.01); G06V 30/418 (2022.01)] | 18 Claims |

|
1. An apparatus, comprising:
a processor that, when executing instructions stored in a memory, is configured to:
generate a plurality of bounding boxes at a plurality of content areas in an image, wherein the plurality of bounding boxes correspond to a plurality of pieces of text within the image;
convert the plurality of bounding boxes into a plurality of bounding box vectors based on attributes of the plurality of bounding boxes;
select a bounding box of the plurality of bounding boxes as an anchor bounding box;
identify bounding boxes that are adjacent to the anchor bounding box;
concatenate bounding box vectors corresponding to the bounding boxes, that are adjacent to the anchor bounding box, into a concatenated bounding box vector;
store a mapping between the anchor bounding box and the concatenated bounding box vector;
train a machine learning model to create a trained machine learning model to transform a bounding box into a location in vector space based on mappings, including the concatenated bounding box vector; and
store the machine learning model in the memory.
|