| CPC G06F 16/1748 (2019.01) [G06F 16/1744 (2019.01)] | 20 Claims |

|
1. A method comprising:
selecting, by processing circuitry of a data platform, a plurality of data chunks from a data store for creating a chunkfile;
in response to selecting the plurality of data chunks from the data store for creating the chunkfile, determining, by the processing circuitry of the data platform, an entropy value for each of the plurality of data chunks selected to obtain a corresponding plurality of entropy values by, at least in part, calculating, via the processing circuitry, the entropy value based on a determined frequency of each of a plurality of symbols representative of each of the plurality of data chunks selected;
reorganizing, by the processing circuitry and based on the corresponding plurality of entropy values, the plurality of data chunks to obtain a reorganized plurality of data chunks;
compressing, by the processing circuitry, the reorganized plurality of data chunks to obtain a compressed chunkfile; and
storing, by the processing circuitry, the compressed chunkfile superseding the plurality of data chunks.
|