US 11,720,589 B2
System and method for transformation of unstructured document tables into structured relational data tables
Joy Mustafi, Telangana (IN)
Assigned to salesforce.com, inc., San Francisco, CA (US)
Filed by salesforce.com, inc., San Francisco, CA (US)
Filed on Aug. 10, 2021, as Appl. No. 17/398,755.
Application 17/398,755 is a continuation of application No. 16/536,165, filed on Aug. 8, 2019, granted, now 11,106,668.
Prior Publication US 2021/0365450 A1, Nov. 25, 2021
This patent is subject to a terminal disclaimer.
Int. Cl. G06F 17/30 (2006.01); G06F 16/25 (2019.01); G06F 16/2453 (2019.01); G06F 16/28 (2019.01)
CPC G06F 16/258 (2019.01) [G06F 16/24534 (2019.01); G06F 16/254 (2019.01); G06F 16/282 (2019.01); G06F 16/284 (2019.01)] 20 Claims
OG exemplary drawing
 
1. A system for transforming an unstructured table from a textual document into a table suitable for a relational database, the system comprising:
a communication interface that receives the textual document having the unstructured table;
a memory containing machine readable medium storing machine executable code; and
one or more processors coupled to the memory and configurable to execute the machine executable code to cause the one or more processors to:
identify a tabular structure relating to the unstructured table from the textual document;
detect one or more headers based on a characteristic associated with a text in the one or more headers from the tabular structure;
create one or more new headers and corresponding new columns by expanding the one or more headers based on a header pattern of the detected one or more headers;
generate a one-dimensional table structure according to a relational database format based on the one or more new headers; and
populate the one-dimensional table structure with corresponding data values extracted from the unstructured table.