US 11,720,596 B2
Identifying content and structure of OLAP dimensions from a spreadsheet
Andrew Thomas Nelmes, East Sheen (GB); Jonathan Co, York (GB); and Alexandros Komninos, York (GB)
Assigned to International Business Machines Corporation, Armonk, NY (US)
Filed by INTERNATIONAL BUSINESS MACHINES CORPORATION, Armonk, NY (US); and University of York, York (GB)
Filed on Feb. 21, 2020, as Appl. No. 16/797,007.
Claims priority of application No. 1916803 (GB), filed on Nov. 19, 2019.
Prior Publication US 2021/0149919 A1, May 20, 2021
Int. Cl. G06F 16/28 (2019.01); G06F 40/30 (2020.01); G06F 40/18 (2020.01); G06F 16/22 (2019.01)
CPC G06F 16/283 (2019.01) [G06F 16/221 (2019.01); G06F 16/282 (2019.01); G06F 16/285 (2019.01); G06F 40/18 (2020.01); G06F 40/30 (2020.01)] 18 Claims
OG exemplary drawing
 
1. A computer-implemented method for identifying content and structure of OLAP dimensions from a spreadsheet, the method comprising:
identifying one or more tables within a spreadsheet, the tables comprising numerical data within the spreadsheet in tabular form, wherein the tables are a column-based type or a crosstab type;
determining labels from the identified one or more tables within the spreadsheet, the labels determined from the identified one or more tables using natural language processing, the labels uniquely identifying the numerical data within the one or more tables; and
determining a set of OLAP dimensions for the spreadsheet based on the one or more tables and the determined labels within the spreadsheet.