CPC G06F 16/278 (2019.01) | 17 Claims |
1. A system for analysis and management of data distribution in a distributed database environment, comprising:
one or more processors;
a first agent executing on the one of more processors to collect statements between an application and a database located in at least one data server in a plurality of data servers, wherein the statements collected include a command stream; and
a second agent exeuting on the one or more processors to perform operations comprising:
analyze results by analyzing the information including the collected statements, to produce analysis results;
producing analysis results, wherein producing analysis results includes:
identifying in the command stream of the collected statements patterns of command that appear together; and
identifying related tables in the database from the identified patterns of command; and
using the analysis results to produce a data distribution policy having one or more rules for defining logical data chunks of the database that include at least the related tables and for distributing and managing the logical data chunks for storage among the plurality of data servers in a horizontally partitioned database.
|