| CPC G06F 16/215 (2019.01) | 17 Claims |

|
1. A system for determining and maintaining data quality in data processing, the system comprising:
at least one non-transitory storage device containing instructions; and
at least one processing device coupled to the at least one non-transitory storage device, wherein the at least one processing device, upon execution of the instructions, is configured to:
render a flow designer interface including a flow portion and a plugin portion, wherein the flow portion creates a flow operation using blocks connected defining a flow, wherein the blocks are one or more plugins selected from the plugin portion;
determine, via the flow designer interface, the one or more plugins used in a flow execution of the flow portion, wherein the flow execution performs at least one transformation on one or more data sets;
determine one or more automatic data quality rules based on at least one of the one or more plugins using in the flow execution of the flow designer interface, wherein the one or more automatic data quality rules comprises a column validation, a cross column validation, and a feed level validation;
determine one or more custom data quality rules based on a custom data quality rule input received from an end-point device, wherein the custom data quality rule comprises one or more data statistics to be monitored during the flow execution;
cause execution of each of the automatic data quality rules and the one or more custom data quality rules on data in the flow execution; and
present, via the flow designer interface, connections between rendering of the one or more plugins used in the flow execution and apply the determined one or more automatic data quality rules based on the connection between the one or more plugins.
|