US 12,265,503 B2
Adaptive deduplication of data chunks
Aiswarya Bhavani Shankar, San Jose, CA (US); Dane Van Dyck, Atlanta, GA (US); Venkata Ranga Radhanikanth Guturi, San Jose, CA (US); and Leo Prasath Arulraj, Santa Clara, CA (US)
Assigned to Cohesity, Inc., San Jose, CA (US)
Filed by Cohesity, Inc., San Jose, CA (US)
Filed on Mar. 14, 2023, as Appl. No. 18/183,659.
Prior Publication US 2024/0311342 A1, Sep. 19, 2024
Int. Cl. G06F 16/17 (2019.01); G06F 11/14 (2006.01); G06F 16/174 (2019.01)
CPC G06F 16/1752 (2019.01) [G06F 11/1453 (2013.01); G06F 2201/84 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A method comprising:
identifying, by a data platform implemented by a computing system, a first chunkfile that includes a chunk that matches data for an object of a file system;
computing, by the data platform, properties of one or more of a plurality of chunks included in the first chunkfile, the properties being one or more of:
a number of the chunks, included in the first chunkfile, having chunk expiration times in each of two or more expiration time bands each having a range of expiration times,
respective numbers of references for the chunks included in the first chunkfile, or
respective last referenced times for the chunks included in the first chunkfile;
determining, by the data platform after identifying the first chunkfile, whether to deduplicate the data for the object of the file system by adding a reference to the matching chunk, wherein determining whether to deduplicate the data for the object of the file system comprises applying a policy to the properties of the one or more of the plurality of chunks included in the first chunkfile to determine whether a write one, read many (WORM) lock expiration time for the first chunkfile can be extended according to the policy; and
in response to determining the WORM lock expiration time for the first chunkfile can be extended according to the policy, deduplicating the data for the object of the file system by adding the reference to the matching chunk.