| CPC G06F 16/254 (2019.01) [G06F 16/2282 (2019.01); G06F 16/26 (2019.01); G06N 5/04 (2013.01); G06N 20/00 (2019.01)] | 20 Claims |

|
1. A system for monitoring an extract, transform, load (ETL) pipeline, comprising:
one or more memories; and
one or more processors, coupled to the one or more memories, configured to:
receive configuration information associated with the ETL pipeline that includes one or more data sources and one or more data sinks;
execute one or more test cases within the ETL pipeline to generate one or more metrics associated with a quality or reliability of data in the ETL pipeline, the one or more test cases each being executed based on extracting respective test case data from a respective source table, transforming the respective test case data in the ETL pipeline, and loading the respective transformed test case data into a respective target table;
generate one or more predicted quality metrics associated with the ETL pipeline using a machine learning model and using the one or more metrics,
wherein the machine learning model is trained using historical execution data associated with one or more ETL jobs; and
generate a visualization that indicates data flow from the one or more data sources to the one or more data sinks and that indicates the one or more predicted quality metrics.
|