US 12,282,676 B2
Recovery of clustered storage systems
Siamak Nazari, Mountain View, CA (US); David Dejong, Fremont, CA (US); Srinivasa Murthy, Cupertino, CA (US); Shayan Askarian Namaghi, San Jose, CA (US); and Roopesh Tamma, San Ramon, CA (US)
Assigned to Nvidia Corporation, Santa Clara, CA (US)
Filed by NVIDIA CORPORATION, Santa Clara, CA (US)
Filed on Mar. 2, 2023, as Appl. No. 18/116,740.
Application 18/116,740 is a continuation in part of application No. 18/115,211, filed on Feb. 28, 2023.
Claims priority of provisional application 63/314,987, filed on Feb. 28, 2022.
Claims priority of provisional application 63/314,996, filed on Feb. 28, 2022.
Claims priority of provisional application 63/314,970, filed on Feb. 28, 2022.
Claims priority of provisional application 63/316,081, filed on Mar. 3, 2022.
Prior Publication US 2023/0273742 A1, Aug. 31, 2023
Int. Cl. G06F 3/06 (2006.01)
CPC G06F 3/065 (2013.01) [G06F 3/0623 (2013.01); G06F 3/067 (2013.01); G06F 3/0683 (2013.01)] 18 Claims
OG exemplary drawing
 
1. A recovery process for a storage system comprising:
creating, for one or more recovery points, a plurality of snapshot sets respectively in a plurality of storage nodes of the storage system, each snapshot set of the snapshot sets corresponding to a recovery point of the one or more recovery points and containing snapshots of every volume that a storage node of the plurality of storage nodes owns or maintains;
assigning, for the one or more recovery points, a generation number to each snapshot in a corresponding snapshot set associated with a respective recovery point;
receiving, from a user of the storage system, a selection of a recovery point from the one or more recovery points;
suspending one or more storage services of the storage system;
in response to receiving the selection of the recovery point, in each storage node of the plurality of storage nodes, promoting the snapshots in the snapshot set corresponding to the recovery point selected by altering metadata associated with each storage node to point to respective data with a generation number in a range between a generation number at creation of the volumes and the generation number assigned to the selected recovery point;
rebooting the storage nodes; and
resuming the one or more storage services of the storage system.