CPC G06F 11/1464 (2013.01) [G06F 9/5083 (2013.01); G06F 11/1451 (2013.01); G06F 2201/84 (2013.01)] | 20 Claims |
1. A data back-up system configured to incrementally back-up at a plurality of back-ups a plurality of data sets from one or more devices to a data back-up server by using a proxy pool, comprising a plurality of proxies, the data back-up system being configured to:
receive, for each of the data sets, an indication of one or more types of a plurality of different files therein, a number of the files, and a total size of each of the files;
estimate a weighted average load of each of the data sets based on the number, the total sizes, a compressibility factor for each of the one or more types, and an encryption factor for each of the one or more types, wherein each of the estimates indicates a time for performing the back-up of the data set, and wherein at least one of the data sets comprises one or more of the files that is different from at least one of the files corresponding to another one of the data sets;
identify, for each of a plurality of workload, a statemap comprising metadata that provides information about which file is backed up at what time, wherein the metadata allows for a determination of whether the file from the same data set is newly added, removed, or modified;
identify, for each of the workloads, a first proxy of the plurality of proxies, wherein the first proxy is available for a next one of the back-ups incrementally of at least one of the one or more data sets, and wherein the first proxy precludes a download or transfer of the identified statemap otherwise performed during a next one back-up; and
distribute one or more of the data sets as a plurality of workloads across the proxies, each of the distributions being based on the weighted average load of the respective data set, wherein each of the workloads is uploaded to the back-up server by the first proxy.
|