| CPC G06F 16/211 (2019.01) [G06F 16/242 (2019.01); G06F 16/2433 (2019.01); G06F 40/226 (2020.01); G06F 40/284 (2020.01)] | 20 Claims |

|
1. A method for automated data dictionary generation and validation, the method comprising:
receiving, by communications circuitry, a first data dictionary;
parsing, by data dictionary generation circuitry, a database query;
identifying, by the data dictionary generation circuitry and based on the parsing of the database query, one or more reference term tokens;
automatically generating, by the data dictionary generation circuitry, a second data dictionary based on the one or more reference term tokens, wherein automatically generating the second data dictionary comprises:
identifying, by the data dictionary generation circuitry, that a reference term token of the one or more reference term tokens in the database query does not include a data source name, and
storing, by the data dictionary generation circuitry, an indication of a missing data source name in a data source-referencing field of the second data dictionary;
validating, by data dictionary validation circuitry, the first data dictionary based on the second data dictionary by:
identifying, by the data dictionary validation circuitry, a set of reference term tokens included in both the first data dictionary and the second data dictionary, and
removing, by the data dictionary validation circuitry, the set of reference term tokens from the second data dictionary; and
causing, by the data dictionary validation circuitry, presentation of a validation status of the first data dictionary, wherein the validation status indicates an unsuccessful validation of the first data dictionary in an instance in which any of the one or more reference term tokens remain in the second data dictionary after removing the set of reference term tokens from the second data dictionary, and wherein the validation status indicates a successful validation of the first data dictionary in an instance in which no reference term tokens remain in the second data dictionary after removing the set of reference term tokens from the second data dictionary.
|