| CPC G06F 3/067 (2013.01) [G06F 3/0604 (2013.01); G06F 3/0641 (2013.01); G06F 11/1469 (2013.01); G06F 16/00 (2019.01); G06F 11/1456 (2013.01); G06F 2201/80 (2013.01)] | 20 Claims |

|
1. A method comprising:
creating a partition group index that performs an indexing of a set of data items into a plurality of partition groups;
deduplicating, concurrently with the indexing, the set of data items stored in a node of a plurality of nodes into a set of deduplicated data items comprising singular copies of each data item of the set of data items, wherein a partition group of the plurality of partition groups corresponds to a node of a first plurality of nodes and comprises a subset of data items of the set of deduplicated data items stored in the node;
formatting the set of deduplicated data items into a recovery file in accordance with the partition group index; and
loading the set of deduplicated data items included in the recovery file onto a second plurality of nodes that are configured to restore the set of deduplicated data items in accordance with the partition group index.
|