US 11,720,549 B1
Data stream integrity using blockchain
Nathaniel G. McKervey, Tallahassee, FL (US); Paul Jean André Bernier, Vancouver (CA); Siegfried Puchbauer, San Francisco, CA (US); and Min Zhang, Vancouver (CA)
Assigned to SPLUNK INC., San Francisco, CA (US)
Filed by Splunk Inc., San Francisco, CA (US)
Filed on Apr. 30, 2021, as Appl. No. 17/246,411.
Int. Cl. G06F 16/23 (2019.01); H04L 9/32 (2006.01); H04L 9/00 (2022.01)
CPC G06F 16/2379 (2019.01) [G06F 16/2365 (2019.01); H04L 9/3239 (2013.01); H04L 9/50 (2022.05)] 30 Claims
OG exemplary drawing
 
1. A method comprising:
receiving, by one or more servers of a network platform, a plurality of machine data items from one or more networked sources;
hashing each of the plurality of machine data items to generate a hashed machine data set;
generating a batch hash by hashing the hashed machine data set to generate the batch hash, wherein a change to a machine data item in the plurality of machine data items will result in a change to the batch hash;
storing the batch hash on a blockchain;
generating a respective plurality of grouped machine data items by adding the batch hash to each of the plurality of machine data items, wherein a grouped machine data item comprises raw data associated with a respective grouped machine data item and the batch hash;
storing the plurality of grouped machine data items in a database;
receiving a request to validate that one or more of the grouped machine data items have not been modified; and
in response to the request, validating a given grouped machine data item by:
using a batch hash associated with the given grouped machine data item to retrieve the plurality of grouped machine data items from the database;
generating a new batch hash from the plurality of grouped machine data items retrieved; and
determining that the new batch hash matches the batch hash stored on the blockchain.