US 12,468,541 B2
Curating anomalous data for use in a data pipeline through interaction with a data source
Ofir Ezrielev, Beer Sheva (IL); Hanna Yehuda, Acton, MA (US); and Kristen Jeanne Walsh, Austin, TX (US)
Assigned to Dell Products L.P., Round Rock, TX (US)
Filed by Dell Products L.P., Round Rock, TX (US)
Filed on Jun. 29, 2023, as Appl. No. 18/343,930.
Prior Publication US 2025/0004783 A1, Jan. 2, 2025
Int. Cl. G06F 9/38 (2018.01)
CPC G06F 9/3867 (2013.01) [G06F 9/3895 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A method of curating data by a data manager, the method comprising:
making an identification that data obtained from a data source associated with a data pipeline comprises an anomalous data point;
identifying a feature of a set of features associated with the anomalous data point that meets importance criteria;
obtaining a final data point based on the feature and through an interaction with the data source; and
populating the data pipeline with the final data point to provide the final data point to a downstream consumer of the data pipeline using one or more application programming interfaces (APIs) associated with the data pipeline, wherein populating the data pipeline with the final data point comprises storing the final data point in a data repository associated with the data pipeline, and
wherein the final data point is populated into the data pipeline to prevent the anomalous data point from being provided to a data consumer and negatively impacting operations and functionalities of a data processing system that uses the data in the data pipeline in one or more processes executed by the data processing system for providing computer-implemented services to the downstream consumer.