| CPC G06F 11/3476 (2013.01) [G06F 11/3006 (2013.01); G06F 11/323 (2013.01)] | 20 Claims |

|
1. A method for tracing data streamed across differing different system platforms, the method comprising:
providing, by each of a plurality of devices and to a database, context data corresponding to a data event published to a streaming service including a streaming pipeline;
storing, in the database, the context data provided by each of the plurality of devices;
reading, by a data classifier, the context data stored in the database and extracting a data classifier block from the stored context data;
gathering, by the data classifier and from the extracted data classifier block, personal identification information and sensitive data elements;
reading, by a lineage processor, the context data stored in the database and extracting a lineage tracer block from the stored context data;
converting, by the lineage processor, the lineage tracer block into a linked lineage triple;
processing, by a lineage processor, the linked lineage triple by tokenizing and deduplicating the linked lineage triple; and
generating, by the data trace builder, a lineage graph using the tokenized and deduplicated linked lineage triple for visualization.
|