US 12,314,220 B2
Data selection for multi-phase file-based data migration
Huijuan Fan, Chengdu (CN); Junhua Shao, Chengdu (CN); and Shuang Zheng, Chengdu (CN)
Assigned to Dell Products L.P., Round Rock, TX (US)
Filed by Dell Products L.P., Round Rock, TX (US)
Filed on Feb. 6, 2023, as Appl. No. 18/106,005.
Claims priority of application No. 202211502668.7 (CN), filed on Nov. 28, 2022.
Prior Publication US 2024/0176761 A1, May 30, 2024
Int. Cl. G06F 16/11 (2019.01); G06F 16/16 (2019.01); G06F 16/178 (2019.01); G06F 16/182 (2019.01)
CPC G06F 16/119 (2019.01) [G06F 16/11 (2019.01); G06F 16/16 (2019.01); G06F 16/178 (2019.01); G06F 16/1827 (2019.01)] 20 Claims
OG exemplary drawing
 
1. An apparatus comprising:
at least one processing device comprising a processor coupled to a memory;
the at least one processing device being configured to perform steps of:
determining, for a given time period corresponding to an expected duration of a multi-phase file-based data migration from a source storage system to a destination storage system, predicted write access count metrics for a plurality of files to be migrated as part of the multi-phase file-based data migration;
selecting, based at least in part on the predicted write access count metrics, a first subset of the plurality of files for migration during a first phase of the multi-phase file-based data migration and a second subset of the plurality of files for migration during a second phase of the multi-phase file-based data migration;
migrating, from the source storage system to the destination storage system, the selected first subset of the plurality of files during the first phase of the multi-phase file-based data migration; and
migrating, from the source storage system to the destination storage system, the selected second subset of the plurality of files during the second phase of the multi-phase file-based data migration;
wherein the first phase of the multi-phase file-based data migration comprises a baseline copy phase and the second phase of the multi-phase file-based data migration comprises an incremental copy phase;
wherein the first subset of the plurality of files comprises files having predicted write access count metrics below a first threshold value; and
wherein the second subset of the plurality of files comprise files having predicted write access count metrics above a second threshold value.