US 12,443,507 B2
System and method for tracing data streamed across different platforms and identifying data manipulations performed across different platforms
Akshay Pore, Phoenix, AZ (US); Piyoosh Bhargav, Phoenix, AZ (US); Anthony Mavromatis, Brooklyn, NY (US); Andrew Jennings, New York, NY (US); and Matthew Rawlings, London (GB)
Assigned to JPMORGAN CHASE BANK, N.A., New York, NY (US)
Filed by JPMorgan Chase Bank, N.A., New York, NY (US)
Filed on Feb. 22, 2024, as Appl. No. 18/584,687.
Prior Publication US 2025/0272213 A1, Aug. 28, 2025
Int. Cl. G06F 11/34 (2006.01); G06F 11/30 (2006.01); G06F 11/32 (2006.01)
CPC G06F 11/3476 (2013.01) [G06F 11/3006 (2013.01); G06F 11/323 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A method for tracing data streamed across differing different system platforms, the method comprising:
providing, by each of a plurality of devices and to a database, context data corresponding to a data event published to a streaming service including a streaming pipeline;
storing, in the database, the context data provided by each of the plurality of devices;
reading, by a data classifier, the context data stored in the database and extracting a data classifier block from the stored context data;
gathering, by the data classifier and from the extracted data classifier block, personal identification information and sensitive data elements;
reading, by a lineage processor, the context data stored in the database and extracting a lineage tracer block from the stored context data;
converting, by the lineage processor, the lineage tracer block into a linked lineage triple;
processing, by a lineage processor, the linked lineage triple by tokenizing and deduplicating the linked lineage triple; and
generating, by the data trace builder, a lineage graph using the tokenized and deduplicated linked lineage triple for visualization.