| CPC G06F 3/0608 (2013.01) [G06F 3/0652 (2013.01); G06F 3/067 (2013.01)] | 20 Claims |

|
1. A method, comprising:
receiving, by a first worker node, an indication of block identifiers of blocks of distributed storage within which a second worker node has stored data;
comparing block identifiers, of blocks within a bin of the distributed storage managed by the first worker node, with the indication of block identifiers to identify a subset of the block identifiers of blocks within the bin to process, wherein a size is set for the subset of the block identifiers based upon a target false positive rate being set for the distributed storage; and
processing the blocks within the bin that correspond to the subset of the block identifiers.
|