US 12,230,046 B2
Searchable table extraction
Shivam Agrahari, Bansi (IN); Dakshayani Singaraju, Tirupati (IN); and Veresh Jain, Bangalore (IN)
Assigned to Oracle Financial Services Software Limited, Maharashtra (IN)
Filed by Oracle Financial Services Software Limited, Mumbai (IN)
Filed on Oct. 24, 2023, as Appl. No. 18/492,910.
Application 18/492,910 is a continuation of application No. 18/174,076, filed on Feb. 24, 2023, granted, now 11,837,004.
Prior Publication US 2024/0290123 A1, Aug. 29, 2024
This patent is subject to a terminal disclaimer.
Int. Cl. G06V 30/00 (2022.01); G06F 16/835 (2019.01); G06F 40/103 (2020.01); G06F 40/143 (2020.01); G06V 10/82 (2022.01); G06V 30/16 (2022.01); G06V 30/19 (2022.01); G06V 30/412 (2022.01); G06V 30/413 (2022.01)
CPC G06V 30/412 (2022.01) [G06F 16/8373 (2019.01); G06F 40/103 (2020.01); G06F 40/143 (2020.01); G06V 10/82 (2022.01); G06V 30/16 (2022.01); G06V 30/19147 (2022.01); G06V 30/413 (2022.01)] 17 Claims
OG exemplary drawing
 
1. A computer-implemented method comprising:
extracting, by a table extraction model from an input document image, a plurality of tables as a plurality of objects, the plurality of tables including a plurality of cells, respectively;
outputting, by the table extraction model, the plurality of tables in an extensible Markup Language (XML) format and outputting cell level information of the plurality of cells;
filtering the plurality of tables based on a plurality of keywords and a condition comprising one from among AND and OR, to reduce a number of the plurality of tables, the filtering further comprising:
for each of the plurality of keywords, traversing each of the plurality of tables along a row direction and a column direction using one or more sub-keywords, each of the one or more sub-keywords being composed of one or more words of the keyword,
based on the traversing:
for each of the plurality of tables, determining whether a table comprises a first keyword from the plurality of keywords and determining whether the table comprises a second keyword from the plurality of keywords,
for the condition being AND, based on the table containing the first keyword and the second keyword, outputting the table, as a filtered table among filtered tables, and, based on the table not containing the first keyword and the second keyword, not outputting the table, and
for the condition being OR, based on the table containing the first keyword or the second keyword, outputting the table, as a filtered table among the filtered tables;
outputting the filtered tables that include the at least one from among the first keyword and the second keyword based on the condition, among the plurality of tables;
executing a query on the filtered tables, the query including one or more keys composed based on the cell level information associated with the filtered tables;
identifying one or more cells among the plurality of cells based on the one or more keys; and
providing target content that corresponds to the one or more cells, wherein the target content is a response to the query.