US 11,928,154 B2
System and method for efficient creation and incremental updating of representations of email conversations
Levi Jonathan Bucao, Austin, TX (US); Peter Anthony Lee, Pflugerville, TX (US); and Ajay Guyyala, Cedar Park, TX (US)
Assigned to CS DISCO, INC., Austin, TX (US)
Filed by CS DISCO, INC., Austin, TX (US)
Filed on Feb. 26, 2021, as Appl. No. 17/187,116.
Prior Publication US 2022/0277158 A1, Sep. 1, 2022
Int. Cl. G06F 16/93 (2019.01); G06F 16/901 (2019.01)
CPC G06F 16/9024 (2019.01) [G06F 16/93 (2019.01)] 21 Claims
OG exemplary drawing
 
1. A system for document analysis, comprising:
a processor;
a data store, having an email graph; and
a non-transitory computer readable medium instructions for incrementally updating the email graph, by:
obtaining email data;
determining an incremental graph comprising a set of nodes representing emails and edges representing relationships between the emails, the determination of the incremental graph comprising:
determining a first node for the incremental graph, wherein the first node represents a first email determined from the obtained email data;
adding the first node to the incremental graph;
determining one or more edges for the incremental graph, wherein the one or more edges include at least one edge determined by:
determining a candidate edge for the incremental graph based on a comparison of first text of the first email to second text of a second email represented by a second node in the email graph or the incremental graph; and
adding the candidate edge to the incremental graph as one of the one or more edges;
starting with the one or more edges in the email graph, traversing the email graph to determine a related set of nodes and a related set of edges of the email graph to add to the incremental graph, wherein the related set of nodes and a related set of edges of the email graph are directly or indirectly related to the one or more edges; and
adding the related set of nodes and related set of edges determined from the email graph to the incremental graph;
traversing the incremental graph to identify a set of conversations in the incremental graph, each of the set of conversations comprising a subgraph of the incremental graph; and
updating the identified set of conversations in the email graph based on the incremental graph to incrementally update the email graph based on the obtained email data.