US 12,260,662 B2
Inferring structure information from table images
J Brandon Smock, Seattle, WA (US); Pramod Kumar Sharma, Seattle, WA (US); Natalia Larios Delgado, Kirkland, WA (US); Rohith Venkata Pesala, Frisco, TX (US); and Robin Abraham, Redmond, WA (US)
Assigned to Microsoft Technology Licensing, LLC, Redmond, WA (US)
Filed by Microsoft Technology Licensing, LLC, Redmond, WA (US)
Filed on Jun. 21, 2021, as Appl. No. 17/353,563.
Claims priority of provisional application 63/175,446, filed on Apr. 15, 2021.
Prior Publication US 2022/0335240 A1, Oct. 20, 2022
Int. Cl. G06F 17/00 (2019.01); G06F 18/214 (2023.01); G06F 40/103 (2020.01); G06N 3/045 (2023.01); G06N 3/088 (2023.01); G06V 30/412 (2022.01); G06V 30/414 (2022.01)
CPC G06V 30/412 (2022.01) [G06F 18/214 (2023.01); G06F 40/103 (2020.01); G06N 3/045 (2023.01); G06N 3/088 (2013.01); G06V 30/414 (2022.01)] 19 Claims
OG exemplary drawing
 
1. A computer implemented method comprising:
detecting a table within a document image;
detecting table objects within the table via a table structure recognition and interpretation model that models table structure and interpretation as a set of overlapping bounding boxes within an image;
transforming the table objects into a structured table representation in part by interpreting overlapping table objects as a hierarchical relationship between table objects;
extracting data from the table objects into the structured table representation; and
exporting the structured table representation and its data into a final output format.