US 11,941,278 B2
Data storage system with metadata check-pointing
Norbert Paul Kusters, Seattle, WA (US); Jianhua Fan, Issaquah, WA (US); Shuvabrata Ganguly, Kirkland, WA (US); Danny Wei, Seattle, WA (US); and Avram Israel Blaszka, Seattle, WA (US)
Assigned to Amazon Technologies, Inc., Seattle, WA (US)
Filed by Amazon Technologies, Inc., Seattle, WA (US)
Filed on Nov. 5, 2021, as Appl. No. 17/520,537.
Application 17/520,537 is a continuation of application No. 16/457,008, filed on Jun. 28, 2019, granted, now 11,169,723.
Prior Publication US 2022/0057951 A1, Feb. 24, 2022
This patent is subject to a terminal disclaimer.
Int. Cl. G06F 3/06 (2006.01); G06F 11/16 (2006.01); G06F 11/30 (2006.01)
CPC G06F 3/0644 (2013.01) [G06F 3/0617 (2013.01); G06F 3/0631 (2013.01); G06F 3/065 (2013.01); G06F 3/067 (2013.01); G06F 11/1612 (2013.01); G06F 11/3034 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A data storage system, comprising:
a plurality of head nodes;
a plurality of mass storage devices,
wherein for a volume partition stored in the data storage system, a first and second head node of the plurality of head nodes are configured to:
store data for a replica of the volume partition in a log-structured storage of the respective first or second head node, wherein the log-structured storage comprises a volume data portion and a metadata portion; and
wherein the first head node is configured to store, to one or more of the plurality of mass storage devices, a copy of the metadata portion of the volume partition;
a failure detection agent configured to:
detect a failed one of the plurality of head nodes based on a failure of the failed head node to respond to a ping from the failure detection agent; and
indicate to a plurality of remaining ones of the plurality of head nodes that the failed head node has failed,
wherein the plurality of remaining ones of the plurality of head nodes are each configured to:
identify volume partitions for which replicas are stored on the failed head node; and
initiate, for the identified volume partitions, the designation of a replacement replica for the identified volume partitions on respective ones of the remaining head nodes.