US 12,411,713 B2
	Method for optimizing resource allocation based on prediction with reinforcement learning
Wen-Shyen Chen, Taichung (TW); Ming-Jye Sheu, Saratoga, CA (US); and Henry H. Tzeng, San Jose, CA (US)
Assigned to ProphetStor Data Services, Inc., Taichung (TW)
Filed by ProphetStor Data Services, Inc., Taichung (TW)
Filed on Feb. 23, 2022, as Appl. No. 17/678,103.
Prior Publication US 2023/0267006 A1, Aug. 24, 2023
Int. Cl. G06F 9/50 (2006.01)

CPC G06F 9/5027 (2013.01)

8 Claims

1. A method for optimizing resource allocation in a computer cluster based on prediction with reinforcement learning, implemented by a processor, comprising the steps of:

a) providing a prediction on the number of units of a hardware resource needed for a workload in more than N timepoints after a 0-th timepoint to the processor, wherein there are maximum M units of the source available for provisioning and U_iis the number of units needed at the i-th timepoint according to the prediction, and N, M and i are positive integer;

b) calculating at least one 0-th possible operation cost (POC₀) based on at least one possible provisioned number (PPN) at a 1-th timepoint (PPN₁) ranging from U₁to M by the processor, wherein the POC₀is given by

POC₀=K+RF×|PPN₁−K|+PPN₁,

where RF is a rebalance factor between 0 and 1, and K is a real number;

c) for each i-th timepoint with i from 1 to N:

c1) calculating at least one i-th possible operation cost (POC_i), wherein the POC_iis given by

POC_i=POC_(i−1)+RF×|PPN_(i+1)−PPN_i|+PPN_(i+1),

where POC_(i−1)is the possible operation cost(s) calculated for the (i−1)-th timepoint, PPN_(i+1)is the PPN at the (i+1)-th timepoint ranging from U_(i+1)to M, PPN_iis the PPN at the i-th timepoint ranging from U_ito M, and PPN_is used for calculating POC_iand POC_(i−1)have the same value;

c2) identifying the smallest and the second smallest POC_ito efficiently prune search space and reduce computational complexity; and

c3) if the smallest and the second smallest POC_iare calculated from the same PPN_i, then setting the PPN_iused to calculate the smallest POC_ias an i-th assigned number, and removing the POC_i(s) not calculated from the i-th assigned number for the calculation of next timepoint, thereby further reducing computational burden; and

d) provisioning an i-th assigned number of units of the hardware resource at the i-th timepoint for the workload by the processor to dynamically adjust resource allocation within the computer cluster.