US 12,217,091 B2
	Systems and methods for autoscaling instance groups of computing platforms
Mahmoud Abdelsalam, Kirkland, WA (US); Ryan McNamara, Seattle, WA (US); Ashray Jain, London (GB); Greg DeArment, Seattle, WA (US); Jason Ma, Seattle, WA (US); Guodong Xu, Issaquah, WA (US); and Vivek Lakshmanan, Seattle, WA (US)
Assigned to Palantir Technologies Inc., Denver, CO (US)
Filed by Palantir Technologies Inc., Denver, CO (US)
Filed on Apr. 22, 2022, as Appl. No. 17/727,380.
Application 17/727,380 is a division of application No. 16/398,435, filed on Apr. 30, 2019, granted, now 11,340,947.
Application 16/398,435 is a division of application No. 16/255,715, filed on Jan. 23, 2019, granted, now 10,324,763, issued on Jun. 18, 2019.
Claims priority of provisional application 62/778,138, filed on Dec. 11, 2018.
Prior Publication US 2022/0244997 A1, Aug. 4, 2022
Int. Cl. G06F 9/46 (2006.01); G06F 8/65 (2018.01); G06F 9/455 (2018.01); G06F 9/48 (2006.01); G06F 9/50 (2006.01)

CPC G06F 9/5027 (2013.01) [G06F 8/65 (2013.01); G06F 9/45558 (2013.01); G06F 9/4881 (2013.01); G06F 9/5061 (2013.01); G06F 9/5077 (2013.01); G06F 2009/45562 (2013.01); G06F 2209/5011 (2013.01); G06F 2209/5022 (2013.01)]

18 Claims

1. A computer-implemented system for scaling down an instance group of a computing platform, the system comprising:

one or more processors; and

a memory storing instructions, the instructions, when executed by the one or more processors, causing the system to perform:

determining whether a utilization percentage of resources allocated to an instance associated with the instance group is smaller than a predetermined scale-down threshold, the resources allocated to the instance including one or more running pods, the instance group including a predetermined desired number of instances; and

in response to determining that the utilization percentage of the instance is smaller than the predetermined scale-down threshold:

waiting for each running pod of the one or more running pods associated with the instance to run to completion;

preventing an additional pod being scheduled on the instance;

in response to each running pod of the one or more running pods associated with the instance running to completion, detaching the instance from the instance group; and

in response to detaching the instance from the instance group, decreasing by one the predetermined desired instance number associated with the instance group.