CPC G06F 16/9024 (2019.01) [G06F 16/93 (2019.01)] | 21 Claims |
1. A system for document analysis, comprising:
a processor;
a data store, having an email graph; and
a non-transitory computer readable medium instructions for incrementally updating the email graph, by:
obtaining email data;
determining an incremental graph comprising a set of nodes representing emails and edges representing relationships between the emails, the determination of the incremental graph comprising:
determining a first node for the incremental graph, wherein the first node represents a first email determined from the obtained email data;
adding the first node to the incremental graph;
determining one or more edges for the incremental graph, wherein the one or more edges include at least one edge determined by:
determining a candidate edge for the incremental graph based on a comparison of first text of the first email to second text of a second email represented by a second node in the email graph or the incremental graph; and
adding the candidate edge to the incremental graph as one of the one or more edges;
starting with the one or more edges in the email graph, traversing the email graph to determine a related set of nodes and a related set of edges of the email graph to add to the incremental graph, wherein the related set of nodes and a related set of edges of the email graph are directly or indirectly related to the one or more edges; and
adding the related set of nodes and related set of edges determined from the email graph to the incremental graph;
traversing the incremental graph to identify a set of conversations in the incremental graph, each of the set of conversations comprising a subgraph of the incremental graph; and
updating the identified set of conversations in the email graph based on the incremental graph to incrementally update the email graph based on the obtained email data.
|