| CPC G06F 16/1844 (2019.01) [G06F 16/119 (2019.01); G06F 16/162 (2019.01); G06F 16/178 (2019.01)] | 20 Claims |

|
1. A computer program product for managing files in a fileset at a primary storage site and a secondary storage site over a network, wherein the computer program product comprises a computer readable storage medium having program instructions embodied therewith, the program instructions executable by a processor to cause operations, the operations comprising:
determining whether files in a fileset satisfy a management policy;
for a first group of files in the fileset determined to satisfy the management policy, encoding the first group of files in a local storage at the primary storage site to migrate by streaming to a file container in an object storage over a local network, wherein the primary storage site includes the object storage, the local storage, and the local network;
migrating file metadata for the migrated files to the object storage;
updating the file metadata for the migrated files to indicate access information in the file container;
deleting file data for the files in the first group of migrated files from the fileset in the local storage;
encoding a second group of files in the fileset that did not satisfy the management policy to pre-migrate by streamlining the second group of files from the fileset to the file container while leaving a copy of the files in the second group in the fileset in the local storage; and
for each specified file of files in the fileset, perform:
determining whether the specified file in the fileset is a new file, based on a time the specified file was created and a current time, that has not been migrated to the file container within a replication time;
encoding the specified file in the fileset, that is determined to be a new file that has not been migrated within the replication time, to migrate by streaming of
the file container and replicate to the secondary storage site; and
copying the specified file in the fileset, that is determined not to be a new file that has not been migrated within the replication time, to the secondary storage site.
|