| CPC G06V 30/19147 (2022.01) [G06V 10/82 (2022.01); G06V 30/1475 (2022.01); G06V 30/1607 (2022.01); G06V 30/19173 (2022.01)] | 20 Claims |

|
1. A method for extracting text information from images, comprising:
obtaining an extraction request associated with live data comprising an image;
generating, using a prediction model, rotational variant features and rotational invariant features associated with the live data;
generating, using the prediction model, text embeddings associated with the rotational variant features using overlapping kernel-based embedding on the live data;
generating, using the prediction model, attention values for each pixel in the live data using context attention;
applying a trained language model to the text embeddings, attention values, and the live data to generate predictions; and
performing extraction actions based on the predictions.
|