CPC G06F 9/5027 (2013.01) [G06F 9/5077 (2013.01); G06F 9/5083 (2013.01); G06F 9/542 (2013.01); H04L 67/1001 (2022.05); G06F 2209/5011 (2013.01)] | 27 Claims |
1. A method comprising:
receiving query information from a first set of plurality of devices arranged in a first foreground cluster and a second set of plurality of devices arranged in a second foreground cluster associated with a data system, the query information including sources of queries requested by the first set of plurality of devices, the first foreground cluster being limited to handling only a first service type and the second foreground cluster being limited to handling only a second service type;
generating a first demand for computing resources for the data system based on the query information;
receiving a first distribution based on the first demand;
determining that a first source of queries has reached a cap limit of number of computing resources to be allocated for processing queries from the first source;
transmitting a first instruction to the first set of plurality of devices to release a first computing resource from the first distribution allocated to processing queries of the first service type from the first source based on determining that the first source has reached the cap limit;
transmitting a second instruction to the second set of plurality devices to release a second computing resource from the first distribution allocated to processing queries of the second service type from the first resource based on determining that the first source has reached the cap limit;
receiving a notification from the first of computing resources device regarding release of the first and second computing resources; and
allocating the released first and second computing resources to process queries from a different source.
|