CPC G06F 21/6218 (2013.01) [G06F 16/213 (2019.01); G06F 16/2423 (2019.01); G06F 16/24534 (2019.01); G06F 16/24542 (2019.01); G06F 16/9024 (2019.01); G06F 21/6227 (2013.01); G06N 3/08 (2013.01); G06N 5/022 (2013.01); G06N 5/04 (2013.01)] | 14 Claims |
1. A method, comprising:
receiving a query at a dataset access platform, the query being formatted according to a first data schema, the query comprising data associated with a request to access a dataset;
generating a copy of the query;
identifying whether the query is a master or a replica as the copy of the query;
identifying a datastore for storing the query as either the master or the copy, or both;
updating a graph as a data model associated with the query to identify elements to distinguish the copy for data operations to be performed;
parsing the copy of the query in the first schema, the parsing being performed by an inference engine configured to identify the dataset, to infer an attribute associated with the query, and to generate one or more data links between the dataset and another dataset accessible by the dataset access platform, wherein parsing the copy of the query further includes parsing the query into a data structure including an abstract syntax tree associated with a target query language;
rewriting the copy of the query in a second schema including a triples-based format and, if the attribute indicates the query is configured to provide authentication data to access the dataset, the rewriting comprising converting the copy of the query into a triple and converting the attribute into another triple;
optimizing rewriting the copy of the query,
determining one or more property paths to the another dataset;
identifying a database engine to execute the query, the database engine is configured to be topologically internal to a data network associated with the dataset access platform; and
converting other data to a further triple, the other data and the further triple being associated with a path configured to route the query or the copy of the query in the second schema including the triples-based format from the dataset access platform to retrieve query results from a target database configured to store the dataset as graph-based data,
wherein the one or more property paths are determined by performing a comparison of another attribute associated with each of the one or more property paths to a threshold to identify an optimal path to run the query or the copy of the query.
|