CPC G06F 16/137 (2019.01) [G06F 16/152 (2019.01); G06F 16/168 (2019.01); G06F 16/183 (2019.01); G06F 16/285 (2019.01); G06F 16/93 (2019.01); H04L 67/01 (2022.05); H04L 67/06 (2013.01); H04L 67/10 (2013.01); H04L 67/1097 (2013.01)] | 18 Claims |
1. A method for detecting similar files, the method comprising:
receiving, at a processor, a request from a user device to upload a file to a server;
extracting file information comprising at least a filename, a file size, and metadata from the file with an upload client, wherein the metadata includes information regarding creation of the file separate from content of the file;
generating, by the server, a file signature for the file based on at least the filename, the file size, and the metadata, wherein the file signature is different from a hash signature;
accessing one or more existing file signatures for each of one or more existing files stored on the server;
comparing the one or more existing file signatures to the file signature;
accessing a first hash signature for the existing file corresponding to the existing file signature;
upon determining that the file signature is within a predetermined deviation from one of the existing file signatures based on the comparison, generating a second hash signature for the file corresponding to the file signature; and
storing the file to the server responsive to determining that the first hash signature does not equal the second hash signature.
|