US 12,249,169 B1
Processing multiple documents in an image
Tamanna Agrawal, Kosabadi (IN); Shivshankar Reddy, Bangalore (IN); and Himanshu Sharad Bhatt, Bangalore (IN)
Assigned to AMERICAN EXPRESS TRAVEL RELATED SERVICES COMPANY, INC., New York, NY (US)
Filed by American Express Travel Related Services Company, Inc., New York, NY (US)
Filed on Dec. 21, 2023, as Appl. No. 18/392,610.
Int. Cl. G06V 30/148 (2022.01); G06V 10/82 (2022.01); G06V 30/146 (2022.01); G06V 30/40 (2022.01)
CPC G06V 30/158 (2022.01) [G06V 10/82 (2022.01); G06V 30/1463 (2022.01); G06V 30/147 (2022.01); G06V 30/40 (2022.01)] 30 Claims
OG exemplary drawing
 
1. A system, comprising:
a computing device comprising a processor and a memory; and
machine-readable instructions stored in the memory that, when executed by the processor, cause the computing device to at least:
identify first text from an image;
determine a first alignment of the first text;
identify second text from the image;
determine a second alignment of the second text, wherein the first alignment differs from the second alignment;
determine a first orientation of the first text based at least in part on a calculation of an average alignment angle of the first alignment;
determine a second orientation of the second text based at least in part on a calculation of an average alignment angle of the second alignment;
identify, using an object detection model, a first document in the image based at least in part on the first orientation and a second document in the image based at least in part on the second orientation;
generate a first separate image file representing the first document; and
generate a second separate image file representing the second document.