CPC G06F 12/0253 (2013.01) [G06F 16/2246 (2019.01); G06F 16/2272 (2019.01)] | 34 Claims |
1. A method for combining garbage collection and data integrity checking on a distributed key-value (KV) store utilized by a cluster of a distributed storage management system, the method comprising:
during a metadata collection phase of the garbage collection, concurrently identifying within the distributed KV store (i) unused block identifiers (IDs), corresponding to data blocks that represent garbage to be collected, that are no longer in use by the cluster but that are present in the distributed KV store and (ii) data integrity errors in a form of missing block IDs that are in use by the cluster but that are missing from the distributed KV store;
marking the unused block IDs for deletion from the distributed KV store;
adding the missing block IDs to a list of block IDs for which remediation of the respective data integrity errors is to be subsequently performed;
perform the garbage collection at a first predetermined or configurable interval with truncated block IDs; and
perform the garbage collection at a second predetermined or configurable interval with full block IDs.
|