US 11,842,215 B2
Autoscaling and throttling in an elastic cloud service
Johan Harjono, San Francisco, CA (US); Daniel Geoffrey Karp, San Carlos, CA (US); Kunal Prafulla Nabar, Burlingame, CA (US); Rares Radut, Kitchener (CA); and Arthur Kelvin Shi, San Francisco, CA (US)
Assigned to Snowflake Inc., Bozeman, MT (US)
Filed by Snowflake Inc., Bozeman, MT (US)
Filed on Jan. 28, 2023, as Appl. No. 18/161,044.
Application 18/161,044 is a continuation of application No. 17/463,366, filed on Aug. 31, 2021, granted, now 11,599,389.
Claims priority of provisional application 63/202,769, filed on Jun. 23, 2021.
Prior Publication US 2023/0176909 A1, Jun. 8, 2023
This patent is subject to a terminal disclaimer.
Int. Cl. G06F 9/50 (2006.01); G06F 9/355 (2018.01)
CPC G06F 9/5005 (2013.01) [G06F 9/3555 (2013.01); G06F 9/505 (2013.01); G06F 9/5077 (2013.01); G06F 9/5083 (2013.01)] 24 Claims
OG exemplary drawing
 
1. A method comprising:
generating an estimate of an allotment of new inbound requests to be received by a computing resource, the estimate defined to reduce resource usage of the computing resource below a maximum threshold;
setting a gateway limit of new inbound requests based on the estimate to throttle new inbound requests received by the computing resource;
monitoring resource usage at the computing resource in response to the throttling;
determining that the resource usage is below a minimum threshold;
determining that a rejection rate of new inbound requests is above a rejection threshold; and
expanding the allotment of new inbound requests to be received by the computing resource based on determining that the resource usage is below a minimum threshold and that the rejection rate of new inbound requests is above the rejection threshold.