CPC G06F 16/283 (2019.01) [G06F 9/5072 (2013.01); G06F 16/2455 (2019.01); H04L 41/0896 (2013.01); H04L 41/5025 (2013.01); H04L 67/1008 (2013.01); H04L 67/1097 (2013.01); H04L 43/0817 (2013.01)] | 26 Claims |
1. A method for implementing a fault-tolerant data warehouse using availability zones, comprising:
allocating a plurality of processing units to a data warehouse, the plurality of processing units comprising at least two processing units, the two processing units located in different availability zones, an availability zone comprising one or more data centers, each data center comprising redundant power, networking, and connectivity;
routing, by a processor, a query to a processing unit within the data warehouse, the query having a common session identifier with a query previously provided to the processing unit, the processing unit determined to be caching a data segment usable by the query, wherein:
the data warehouse accesses data within a database associated with a cloud storage resource;
the cloud storage resource is independent of the plurality of processing units; and
each of the plurality of processing units comprises a processor and a cache memory in which data associated with the database is cached;
as a result of monitoring a query workload metric, wherein the query workload metric is a number of queries running at an input degree of parallelism, determining that a processing capacity of the plurality of processing units has reached a threshold; and
changing a total number of processing units associated with the data warehouse using the query workload metric.
|