| CPC G06F 16/1752 (2019.01) [G06F 11/1453 (2013.01); G06F 2201/84 (2013.01)] | 20 Claims |

|
1. A method comprising:
identifying, by a data platform implemented by a computing system, a first chunkfile that includes a chunk that matches data for an object of a file system;
computing, by the data platform, properties of one or more of a plurality of chunks included in the first chunkfile, the properties being one or more of:
a number of the chunks, included in the first chunkfile, having chunk expiration times in each of two or more expiration time bands each having a range of expiration times,
respective numbers of references for the chunks included in the first chunkfile, or
respective last referenced times for the chunks included in the first chunkfile;
determining, by the data platform after identifying the first chunkfile, whether to deduplicate the data for the object of the file system by adding a reference to the matching chunk, wherein determining whether to deduplicate the data for the object of the file system comprises applying a policy to the properties of the one or more of the plurality of chunks included in the first chunkfile to determine whether a write one, read many (WORM) lock expiration time for the first chunkfile can be extended according to the policy; and
in response to determining the WORM lock expiration time for the first chunkfile can be extended according to the policy, deduplicating the data for the object of the file system by adding the reference to the matching chunk.
|