CPC G06V 10/22 (2022.01) [G06F 16/9024 (2019.01); G06K 9/6256 (2013.01); G06N 3/04 (2013.01); G06Q 10/0833 (2013.01); G06Q 10/0838 (2013.01); G06Q 20/407 (2013.01); G06V 10/457 (2022.01); G06V 10/464 (2022.01); G06V 30/10 (2022.01)] | 20 Claims |
1. A system, comprising:
a non-transitory memory; and
one or more hardware processors coupled with the non-transitory memory and configured to read instructions from the non-transitory memory to cause the system to perform operations comprising:
deriving text data from an image of a shipment label;
analyzing text characteristics of the text data in the image;
determining, from the image, a plurality of bounding boxes corresponding to a plurality of distinct text groups in the image based on the analyzing the text characteristics of the text data;
determining, for each of the plurality of bounding boxes, features comprising at least a location of a corresponding bounding box, a size of the corresponding bounding box, and a text arrangement within the corresponding bounding box;
constructing a graph comprising a plurality of nodes and a plurality of edges based on the features determined for each of the plurality of bounding boxes, wherein the constructing comprises generating a node for each bounding box in the plurality of bounding boxes and generating an edge between a pair of nodes based on locations of a corresponding pair of bounding boxes;
determining, from the plurality of bounding boxes, a particular bounding box corresponding to a tracking number of a shipment by providing attributes of the graph as inputs to a graph neural network; and
performing an action in association with a user account based on a shipment status associated with the tracking number.
|