CPC G06F 3/0608 (2013.01) [G06F 3/067 (2013.01); G06F 3/0623 (2013.01); G06F 3/0659 (2013.01); H03M 7/6005 (2013.01); H03M 7/6011 (2013.01)] | 18 Claims |
1. A system for encoding data using mismatch probability estimation, comprising:
a computing device comprising a processor, a memory, and a non-volatile data storage device;
a statistical analyzer comprising a first plurality of programming instructions stored in the memory which, when operating on the processor, causes the computing device to:
receive a training data set for encoding, the training data set comprising sourceblocks of data;
determine a frequency of occurrence of each sourceblock of the training data set;
calculate a mismatch probability estimate comprising a probability that any given sourceblock in a non-training data set to be later received for encoding will not be a sourceblock that was contained in the training data set;
generate a mismatch sourceblock representing sourceblocks that were not contained in the training data set, and assign the mismatch probability estimate to the mismatch sourceblock as the frequency of occurrence of the mismatch sourceblock; and
a codebook generator comprising a second plurality of programming instructions stored in the memory which, when operating on the processor, causes the computing device to:
generate a codebook from the sourceblocks of the training data set and the mismatch sourceblock using an entropy encoding method wherein codewords are assigned to each sourceblock based on its frequency of occurrence.
|