US 12,008,309 B1
Document marking techniques using semantically similar phrases for document source detection
Sanjay Krishnan, Hinsdale, IL (US); David Wong, Los Gatos, CA (US); Chad Voss, Seattle, WA (US); Troy Batterberry, Kirkland, WA (US); Clayton Huthwaite, Chesterfield, VA (US); Colin Saunders, San Francisco, CA (US); Stephen Bianamara, Seattle, WA (US); and Parker Beck, Kenmore, WA (US)
Assigned to EchoMark, Inc., Kirkland, WA (US)
Filed by EchoMark, Inc., Kirkland, WA (US)
Filed on Apr. 4, 2023, as Appl. No. 18/295,710.
Int. Cl. G06F 17/00 (2019.01); G06F 3/0482 (2013.01); G06F 3/0484 (2022.01); G06F 40/166 (2020.01); G06F 40/30 (2020.01); G06N 3/0475 (2023.01)
CPC G06F 40/166 (2020.01) [G06F 3/0482 (2013.01); G06F 3/0484 (2013.01); G06F 40/30 (2020.01); G06N 3/0475 (2023.01)] 20 Claims
OG exemplary drawing
 
1. A system comprising:
at least one processor; and
one or more computer storage media storing computer readable instructions thereon that, when executed by the at least one processor, cause the at least one processor to perform operations comprising:
accessing a document comprising terms;
determining a first set of alternative terms for a first term in the document and a second set of alternative terms for a second term in the document;
selecting one or more alternative terms from each of the first set of alternative terms and the second set of alternative terms based on a tone of the first term and a tone of the second term relative to a tone of the one or more alternative terms;
generating a plurality of unique copies of the document, each unique copy being distinct from other unique copies of the document and generated by replacing the first term and the second term in unique combinations with the selected one or more alternative terms;
individually distributing the plurality of unique copies to a plurality of recipients; and
indexing an association between the plurality of unique copies and the plurality of recipients, such that the index identifies a recipient for an identified unique copy.