US 12,147,401 B2
Elective deduplication
Alexei Kabishcer, Marlborough, MA (US); Uri Shabi, Tel Mond (IL); and Bar Harel, Tel Aviv (IL)
Assigned to Dell Products L.P., Round Rock, TX (US)
Filed by Dell Products L.P., Round Rock, TX (US)
Filed on Jan. 26, 2022, as Appl. No. 17/585,240.
Prior Publication US 2023/0237030 A1, Jul. 27, 2023
Int. Cl. G06F 16/00 (2019.01); G06F 16/215 (2019.01); G06F 16/22 (2019.01); G06F 16/2458 (2019.01); G06F 16/906 (2019.01); G06F 17/18 (2006.01); G06F 18/23 (2023.01)
CPC G06F 16/215 (2019.01) [G06F 16/2255 (2019.01); G06F 16/2462 (2019.01); G06F 16/906 (2019.01); G06F 17/18 (2013.01); G06F 18/23 (2023.01)] 18 Claims
OG exemplary drawing
 
1. A method for electing deduplication in a storage system, the method comprising:
calculating a similarity hash signature for a data unit of a storage system;
searching a digest table of the storage system for a similarity hash signature within a predetermined distance of the similarity hash signature for the data unit;
using the search to determine whether to add a similarity hash signature or a strong hash signature of the data unit to the digest table; and
deduplicating the data unit from a storage device of the storage system based on the determination of whether the similarity hash signature or the strong hash signature is added to the digest table.