US 12,436,919 B2
Sorted entropy chunks for higher space reduction
Sweetesh Singh, New Delhi (IN)
Assigned to Cohesity, Inc., Santa Clara, CA (US)
Filed by Cohesity, Inc., San Jose, CA (US)
Filed on Oct. 24, 2024, as Appl. No. 18/925,675.
Application 18/925,675 is a continuation of application No. 18/497,635, filed on Oct. 30, 2023, granted, now 12,197,389.
Prior Publication US 2025/0139058 A1, May 1, 2025
This patent is subject to a terminal disclaimer.
Int. Cl. G06F 16/174 (2019.01)
CPC G06F 16/1748 (2019.01) [G06F 16/1744 (2019.01)] 20 Claims
OG exemplary drawing
 
1. A method comprising:
selecting, by processing circuitry of a data platform, a plurality of data chunks from a data store for creating a chunkfile;
in response to selecting the plurality of data chunks from the data store for creating the chunkfile, determining, by the processing circuitry of the data platform, an entropy value for each of the plurality of data chunks selected to obtain a corresponding plurality of entropy values by, at least in part, calculating, via the processing circuitry, the entropy value based on a determined frequency of each of a plurality of symbols representative of each of the plurality of data chunks selected;
reorganizing, by the processing circuitry and based on the corresponding plurality of entropy values, the plurality of data chunks to obtain a reorganized plurality of data chunks;
compressing, by the processing circuitry, the reorganized plurality of data chunks to obtain a compressed chunkfile; and
storing, by the processing circuitry, the compressed chunkfile superseding the plurality of data chunks.