US 12,242,465 B2
Systems and methods for zero downtime topology updates for distributed data storage
Hemal Shah, Sunnyvale, CA (US); Shashi Madappa, Campbell, CA (US); and Karan Vohra, Sammamish, WA (US)
Assigned to STRIPE, INC., South San Francisco, CA (US)
Filed by Stripe, Inc., South San Francisco, CA (US)
Filed on Jun. 26, 2023, as Appl. No. 18/214,205.
Prior Publication US 2024/0427758 A1, Dec. 26, 2024
Int. Cl. G06F 15/16 (2006.01); G06F 16/21 (2019.01); G06F 16/23 (2019.01); G06F 16/27 (2019.01)
CPC G06F 16/2379 (2019.01) [G06F 16/211 (2019.01); G06F 16/27 (2019.01)] 17 Claims
OG exemplary drawing
 
1. A method for updating data router nodes in a distributed storage system, comprising:
querying, by a first database agent of a database node, a cache data store for health status metrics of the cache data store;
storing, by the first database agent of the database node, the health status metrics in a data repository;
obtaining, by a second database agent of a router node, the health status metrics from the data repository;
updating, by the second database agent of the router node, a database topology file based on the health status metrics;
instructing, by the second database agent of the router node, a data routing service executed by the router node to pause processing of new data access requests until an updated database topology file is loaded to the data routing service, each new data access request being a data access request received after initiation of the updating of the database topology file;
processing, by the second database agent of the router node, existing data access requests with an existing copy of the database topology file, each existing data access request being received before initiation of the updating of the database topology file;
loading, by the data routing service executed by the router node, the database topology file updated by the second database agent into the routing service; and
processing, by the routing service in response to completion of the loading of the database topology file updated by the second database agent, the new data access requests for the router node.