CPC G06F 16/215 (2019.01) [G06F 16/2255 (2019.01); G06F 16/2425 (2019.01)] | 20 Claims |
1. A method, comprising:
moving a window from a first position in a data buffer to a second position in the data buffer, and the data buffer includes one or more words;
calculating a hash value of data in the window when the window is in the second position;
checking a byte that has entered the window, as a result of a movement of the window from the first position to the second position, to determine whether the byte is whitespace; and
when the hash value is a greatest hash value seen up to a current position of the window, and when the byte is determined to be whitespace, setting a candidate offset to a whitespace offset, and the candidate offset denotes a possible segment boundary that does not fall within any word in the data buffer.
|