US 11,835,994 B2
Publishing to a data warehouse
Andrew Blom, Lexington, MA (US); Darren Miller, Lexington, MA (US); and Marshall A. Isman, Newton, MA (US)
Assigned to Ab Initio Technology LLC, Lexington, MA (US)
Filed by Ab Initio Technology LLC, Lexington, MA (US)
Filed on Jul. 19, 2019, as Appl. No. 16/517,320.
Claims priority of provisional application 62/700,684, filed on Jul. 19, 2018.
Prior Publication US 2020/0026711 A1, Jan. 23, 2020
Int. Cl. G06F 7/00 (2006.01); G06F 17/00 (2019.01); G06F 16/25 (2019.01); G06F 16/901 (2019.01); G06F 8/34 (2018.01); H04L 67/565 (2022.01)
CPC G06F 16/254 (2019.01) [G06F 8/34 (2013.01); G06F 16/258 (2019.01); G06F 16/9024 (2019.01); H04L 67/565 (2022.05)] 34 Claims
OG exemplary drawing
 
1. A method for automatically generating an executable application to transform and load data into a structured dataset for storing data from one or more networked data sources, comprising:
receiving at least one metadata file that specifies values for parameters and ruleset mappings specifying logical rules for transforming data feeds, wherein each data feed is to be received from a networked data source, and loading results of transforming into a structured dataset;
validating that the values of the parameters and the logical rules for transforming the data-feeds are not inconsistent for each data feed;
generating, based on the validating, data rules that specify one or more standards in accordance with the validated values of the parameters and the validated logical rules for transforming each data feed into a transformed record for loading into the structured dataset; and
generating, based on the generated data rules that specify the one or more standards, an executable data processing application for a runtime environment, the executable data processing application configurable to:
receive source data comprising a data feed from one or more data sources; and
transform the source data into transformed data that satisfies the one or more standards of the structured dataset in compliance with the generated data rules,
wherein the source data includes a data record, and wherein a parameter of the metadata file specifies a field of the data record that represents a key value for the data record of the source data, and
wherein a parameter of the metadata file specifies a mapping between the field that represents the key value of the source data and a field of the structured dataset that represents a key value of the structured dataset.