US 12,204,755 B2
Elastic request handling technique for optimizing workload performance
Daniel Chilton, Raleigh, NC (US); Gaurav Gangalwar, Pune (IN); Manoj Premanand Naik, San Jose, CA (US); Pradeep Thomas, Santa Clara, CA (US); and Will Strickland, Wake Forest, NC (US)
Assigned to Nutanix, Inc., San Jose, CA (US)
Filed by Nutanix, Inc., San Jose, CA (US)
Filed on Jun. 29, 2022, as Appl. No. 17/853,123.
Claims priority of application No. 202241025987 (IN), filed on May 4, 2022.
Prior Publication US 2023/0359359 A1, Nov. 9, 2023
Int. Cl. G06F 3/06 (2006.01)
CPC G06F 3/0611 (2013.01) [G06F 3/0655 (2013.01); G06F 3/0679 (2013.01)] 41 Claims
OG exemplary drawing
 
1. A method comprising:
receiving input/output (I/O) requests of a workload at a file server of a node coupled to persistent storage media, wherein different types of I/O requests have corresponding latency bounds;
processing the I/O requests of the workload according to an expected I/O latency using a pool of threads executing on one or more processors of the node;
maintaining a number of the threads as active threads of the pool to run on the processors;
measuring an I/O latency for storing data of the I/O requests on the persistent storage media;
and
in response to the measured I/O latency for storing the data exceeding a predetermined threshold, increasing the number of active threads from the pool deployed to increase processing of the I/O requests to run to completion on the active threads based on the corresponding expected I/O latency bounds for the I/O request type.