US 12,106,149 B2
Flexible computing
Thierry Cruanes, San Mateo, CA (US); Igor Demura, Mountain View, CA (US); Varun Ganesh, San Bruno, CA (US); Prasanna Rajaperumal, Bangalore (IN); Libo Wang, Foster City, CA (US); and Jiaqi Yan, Menlo Park, CA (US)
Assigned to Snowflake Inc., Bozeman, MT (US)
Filed by Snowflake Inc., Bozeman, MT (US)
Filed on Apr. 27, 2023, as Appl. No. 18/140,086.
Application 18/140,086 is a continuation of application No. 18/050,608, filed on Oct. 28, 2022, granted, now 11,687,373.
Application 18/050,608 is a continuation of application No. 17/342,713, filed on Jun. 9, 2021, granted, now 11,513,859.
Application 17/342,713 is a continuation of application No. 17/084,861, filed on Oct. 30, 2020, granted, now 11,055,142.
Application 17/084,861 is a continuation of application No. 16/874,388, filed on May 14, 2020, granted, now 10,860,381.
Prior Publication US 2024/0078135 A1, Mar. 7, 2024
This patent is subject to a terminal disclaimer.
Int. Cl. G06F 9/46 (2006.01); G06F 9/50 (2006.01); G06F 9/54 (2006.01); H04L 67/1001 (2022.01)
CPC G06F 9/5027 (2013.01) [G06F 9/5077 (2013.01); G06F 9/5083 (2013.01); G06F 9/542 (2013.01); H04L 67/1001 (2022.05); G06F 2209/5011 (2013.01)] 27 Claims
OG exemplary drawing
 
1. A method comprising:
receiving query information from a first set of plurality of devices arranged in a first foreground cluster and a second set of plurality of devices arranged in a second foreground cluster associated with a data system, the query information including sources of queries requested by the first set of plurality of devices, the first foreground cluster being limited to handling only a first service type and the second foreground cluster being limited to handling only a second service type;
generating a first demand for computing resources for the data system based on the query information;
receiving a first distribution based on the first demand;
determining that a first source of queries has reached a cap limit of number of computing resources to be allocated for processing queries from the first source;
transmitting a first instruction to the first set of plurality of devices to release a first computing resource from the first distribution allocated to processing queries of the first service type from the first source based on determining that the first source has reached the cap limit;
transmitting a second instruction to the second set of plurality devices to release a second computing resource from the first distribution allocated to processing queries of the second service type from the first resource based on determining that the first source has reached the cap limit;
receiving a notification from the first of computing resources device regarding release of the first and second computing resources; and
allocating the released first and second computing resources to process queries from a different source.