| CPC G06F 16/9024 (2019.01) [G06F 11/328 (2013.01)] | 20 Claims |

|
1. A method comprising:
for a graph to be loaded into memory of one or more machines of a distributed graph processing system, generating a memory consumption estimate by:
sampling graph data of the graph from a source;
estimating graph statistics for the graph based on the sampled graph data;
predicting, based on the graph statistics, an estimated final graph size indicating an amount of memory used to keep a plurality of graph structures in memory,
wherein:
the graph comprises a plurality of entities and one or more edges between entities,
the plurality of graph structures represent the graph, and
the plurality of graph structures comprise at least one vertex data structure representing the plurality of entities of the graph and at least one edge data structure representing the one or more edges between entities; and
predicting, based on the graph statistics, an estimated peak memory usage indicating an upper bound of memory usage across the one or more machines during loading of the graph,
wherein the memory consumption estimate comprises the estimated final graph size and the estimated peak memory usage;
wherein the method is performed by one or more computing devices.
|