CPC G06F 16/2425 (2019.01) [G06F 9/54 (2013.01); G06F 16/212 (2019.01); G06F 16/2282 (2019.01); G06F 16/2456 (2019.01); G06F 16/90324 (2019.01)] | 20 Claims |
1. A computer-implemented method comprising:
receiving a first instructions file comprising first instructions to generate content, when executed by one or more processors and based on content of a first table and a second table from one or more source databases, as one or more columns of a third table of a target database, wherein the first instructions comprise a join instructional statement and a deduplicate instructional statement, and wherein execution, via the one or more processors, of the first instructions of the first instructions file is associated with a first computing time interval;
parsing, via the one or more processors, the first instructions file to identify a plurality of instructional statements, wherein each instructional statement comprises at least one respective operation each configured to operate on the content of one or more of the first and second tables, and relationships between the plurality of the instructional statements;
determining, via the one or more processors and for the join instructional statement, of the plurality of instructional statements, that joins the first table and the second table, a type of relationship between the first table and the second table;
retrieving, via the one or more processors, a collection of known patterns and corresponding recommended replacement patterns, wherein a given known pattern comprises:
an identification of at least two instructional statements, and
an organizational structure between the identified at least two instructional statements;
identifying, via the one or more processors and based on the determined type of relationship between the first table and the second table and based on the identified plurality of instructional statements and the relationships between the plurality of instructional statements, one or more patterns in the relationships between the plurality of instructional statements, in the first instructions file, that correspond to at least one of the known patterns;
generating, via the one or more processors and based on the identified one or more patterns, a first results file comprising:
the identified one or more patterns, and
one or more suggested modifications to the identified one or more patterns based on one or more corresponding recommended replacement patterns;
receiving, via the one or more processors and based on the generating the first results file, a selection of the one or more suggested modifications;
storing, based on the receiving the selection, a second results file comprising the selection of the one or more suggested modifications; and
generating, via the one or more processors and based on the first results file and revised selections of the one or more suggested modifications, the one or more columns of the third table,
wherein the generating the one or more columns of the third table is associated with a second computing time interval that is shorter than the first computing time interval.
|