| CPC G06F 16/2471 (2019.01) [G06F 16/24542 (2019.01); G06F 16/24556 (2019.01); G06F 16/285 (2019.01)] | 20 Claims |

|
1. An apparatus comprising:
at least one processor coupled to at least one memory, the at least one processor, when executing program code, is configured to:
operate as a primary cluster in a clustering database system implemented over one or more processing platforms; and
manage a plurality of secondary clusters, wherein the primary cluster comprises one or more primary database nodes and a database cluster registry of the plurality of secondary clusters, wherein each of the secondary clusters comprises one or more primary database nodes and one or more secondary database nodes;
wherein the primary cluster is further configured to:
generate the database cluster registry, the generating comprising:
registering the plurality of secondary clusters to the clustering database system, the registration comprising replicating data corresponding to each of the plurality of secondary clusters at the database cluster registry as one or more datasets;
automatically synchronizing the one or more primary database nodes of the primary cluster with the one or more primary database nodes of each of the plurality of secondary clusters to obtain metadata from each of the plurality of secondary clusters, wherein the obtained metadata comprises: a database type and a model of each of the plurality of secondary clusters, schema definition data for each of the plurality of secondary clusters, a replication status of the data corresponding to the registered plurality of secondary clusters of the plurality of secondary clusters, an integration type of each of the plurality of secondary clusters, a supported language of each of the plurality of secondary clusters, communication patterns of each of the plurality of secondary clusters, on premises context details and cloud cluster context details;
maintaining the obtained metadata corresponding to each secondary cluster of the plurality of secondary clusters; and
automatically creating definition data corresponding to all available database types of the database cluster registry based on the obtained metadata;
add and remove one or more secondary clusters from the database cluster registry;
receive a query from a client;
authenticate the client, with the one or more primary database nodes of the primary cluster, using one or more client details;
initiate a session for the client based on a positive result of the authentication;
analyze the query to determine an intent of the query and a context of the query in the clustering database system in relation to the registered plurality of secondary clusters and, based on the intent and the context of the query, to identify at least one secondary cluster from the plurality of secondary clusters in the clustering database system to execute the query;
utilize at least a portion of the obtained metadata when analyzing the query;
wherein the data associated with the query is a data type that differs from a data type associated with a database type of the identified at least one secondary cluster, utilize at least a portion of the obtained metadata to dynamically replicate one or more schemas based on the schema definition data and to dynamically replicate the one or more datasets at the primary cluster from a first database type to a second database type, wherein the schema definition data corresponds to the identified at least one secondary cluster, and wherein the second database type is the database type of the identified at least one secondary cluster;
send one or more details of the session to the identified at least one secondary cluster; and
redirect the client to establish a connection between the client and the identified at least one secondary cluster;
wherein the one or more of the plurality of secondary clusters are configured to support a data model type that is different than a data model type of one or more others of the plurality of secondary clusters.
|