CPC G06F 9/5027 (2013.01) [G06F 8/65 (2013.01); G06F 9/45558 (2013.01); G06F 9/4881 (2013.01); G06F 9/5061 (2013.01); G06F 9/5077 (2013.01); G06F 2009/45562 (2013.01); G06F 2209/5011 (2013.01); G06F 2209/5022 (2013.01)] | 18 Claims |
1. A computer-implemented system for scaling down an instance group of a computing platform, the system comprising:
one or more processors; and
a memory storing instructions, the instructions, when executed by the one or more processors, causing the system to perform:
determining whether a utilization percentage of resources allocated to an instance associated with the instance group is smaller than a predetermined scale-down threshold, the resources allocated to the instance including one or more running pods, the instance group including a predetermined desired number of instances; and
in response to determining that the utilization percentage of the instance is smaller than the predetermined scale-down threshold:
waiting for each running pod of the one or more running pods associated with the instance to run to completion;
preventing an additional pod being scheduled on the instance;
in response to each running pod of the one or more running pods associated with the instance running to completion, detaching the instance from the instance group; and
in response to detaching the instance from the instance group, decreasing by one the predetermined desired instance number associated with the instance group.
|