US 12,393,406 B2
Entity search engine powered by copy-detection
Gokcan Ozakdag, London (GB); and Mihir Pandya, San Jose, CA (US)
Assigned to Palantir Technologies Inc., Denver, CO (US)
Filed by Palantir Technologies Inc., Denver, CO (US)
Filed on Sep. 25, 2023, as Appl. No. 18/473,515.
Application 18/473,515 is a continuation of application No. 17/651,220, filed on Feb. 15, 2022, granted, now 11,803,357.
Claims priority of provisional application 63/149,955, filed on Feb. 16, 2021.
Prior Publication US 2024/0012627 A1, Jan. 11, 2024
This patent is subject to a terminal disclaimer.
Int. Cl. G06F 8/36 (2018.01); G06F 8/70 (2018.01)
CPC G06F 8/36 (2013.01) [G06F 8/70 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A computer system comprising:
one or more non-transitory computer readable storage devices configured to store a plurality of computer readable instructions; and
one or more processors configured to execute the plurality of computer readable instructions to cause the computer system to perform operations comprising:
receiving a query code snippet identifying code to be matched;
determining one or more fingerprints of the query code snippet;
searching, using at least the one or more fingerprints of the query code snippet, a database of fingerprints of a plurality of portions of a set of software programs to determine a set of matching portions of the set of software programs;
removing, from the set of matching portions, any matching portions that match a fingerprint of blacklisted code to generate an updated set of matching portions;
coalescing the updated set of matching portions into a software match list; and
ranking the software on the software match list to determine a ranked software match list, the ranking indicative of how well the fingerprints of the software on the software match list matches fingerprints of the query code snippet.