| CPC G06F 9/5038 (2013.01) [G06F 9/5072 (2013.01); G06F 2209/501 (2013.01); G06F 2209/5019 (2013.01)] | 20 Claims |

|
1. A computer-implemented method of managing content during data migration, the method comprising:
receiving, at a prediction machine learning (ML) model, a first dataset including real-time system parameters for a legacy application, the first dataset including tabular data;
determining, at the prediction ML model and based on sampled process times, an estimated time for execution of the data migration of the first dataset;
determining, based on the estimated time and a given checkpoint duration, an estimated number of segments into which the first dataset can be segmented, where a segment refers to a discrete segment of the first dataset expected to migrate during the given checkpoint duration;
determining, based on the number of segments, an estimated segment size for each segment;
determining a degree of data skew for each column;
selecting a first column associated with the smallest data skew as a segmentation column;
automatically generating a code that describes the estimated number of segments, the estimated segment size, and the selected first column; and
executing the data migration of the first dataset based on the generated code.
|