US 12,086,848 B1
Cost-aware tiering for data storage
Umesh Deshpande, San Jose, CA (US); Paul Henri Muench, San Jose, CA (US); Robert Michael Rees, Los Gatos, CA (US); and Thanh T. Pham, San Jose, CA (US)
Assigned to International Business Machines Corporation, Armonk, NY (US)
Filed by INTERNATIONAL BUSINESS MACHINES CORPORATION, Armonk, NY (US)
Filed on Mar. 8, 2023, as Appl. No. 18/180,459.
Int. Cl. G06F 12/00 (2006.01); G06F 3/06 (2006.01); G06F 12/02 (2006.01); G06Q 30/0283 (2023.01); G06Q 30/04 (2012.01)
CPC G06Q 30/0283 (2013.01) [G06F 3/0649 (2013.01); G06F 3/067 (2013.01); G06Q 30/04 (2013.01); G06F 3/0604 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A computer-based method of cost-aware tiering for data storage comprising:
detecting data to be stored in one or more storage systems and gathering access and usage information for each data segment in the detected data;
generating and maintaining a first data structure, the first data structure including periods of activity below a defined threshold for the detected data;
generating and maintaining a common data structure to identify and track application-wide data patterns;
gathering static pricing information and context pricing information for the one or more storage systems;
calculating a tiering threshold corresponding to a continuous period of activity below a second defined threshold required for which local storage costs exceed a tiering cost;
identifying inactive data segments within the detected data and calculating, for each of the identified inactive data segments, probabilities of the inactive data segments remaining inactive for a duration that exceeds the calculated tiering threshold;
calculating a risk score for each of the identified inactive data segments based on the calculated probabilities and a file size of each of the identified inactive data segments;
generating a tiering list based on the calculated risk scores; and
automatically storing the detected data based on the generated tier list.