US 12,079,342 B2
Data lineage management
Anindya Misra, Arlington, VA (US); and Harshit Krishna, Arlington, VA (US)
Assigned to Capital One Services, LLC, McLean, VA (US)
Filed by Capital One Services, LLC, McLean, VA (US)
Filed on May 27, 2022, as Appl. No. 17/804,316.
Application 17/804,316 is a continuation of application No. 16/706,943, filed on Dec. 9, 2019, granted, now 11,347,855.
Application 16/706,943 is a continuation of application No. 16/453,465, filed on Jun. 26, 2019, granted, now 10,503,905, issued on Dec. 10, 2019.
Prior Publication US 2022/0284102 A1, Sep. 8, 2022
Int. Cl. G06F 21/57 (2013.01); G06F 8/65 (2018.01); G06F 16/17 (2019.01); H04L 9/00 (2022.01); H04L 9/06 (2006.01); H04L 9/32 (2006.01)
CPC G06F 21/57 (2013.01) [G06F 8/65 (2013.01); G06F 16/1734 (2019.01); H04L 9/0643 (2013.01); H04L 9/32 (2013.01); H04L 9/50 (2022.05)] 20 Claims
OG exemplary drawing
 
1. A method, comprising:
determining, by a device, a lineage record identifier associated with a dataset,
wherein a lineage record associated with the lineage record identifier is generated using a collection of software development tools that perform one or more calls to a distributed ledger, and
wherein the collection of software development tools is configured to utilize a particular protocol in determining content to be included in the lineage record;
identifying, by the device, one or more blocks associated with a blockchain that includes the lineage record identifier;
obtaining, by the device, and from the one or more blocks, update information associated with the dataset,
wherein the update information identifies a source of an operation and one or more changes made to the dataset by another device during a time period associated with the one or more blocks;
determining, by the device and based on the update information, a first state corresponding to the one or more changes made to the dataset by the other device;
determining, by the device and based on the source of the operation, a second state associated with the dataset as accessed by the device; and
performing, by the device, based on the first state, and based on the second state, one or more updates to the dataset to generate an updated dataset that corresponds to the first state.