| CPC G06F 16/24549 (2019.01) [G06F 16/213 (2019.01); G06F 16/24537 (2019.01); G06F 16/24542 (2019.01); G06F 16/93 (2019.01)] | 20 Claims |

|
1. A system, comprising:
at least one data processor; and
at least one memory storing instructions which, when executed by the at least one data processor, cause operations comprising:
generating, based at least on a schema configuration of one or more document stores, a plurality of data in a JavaScript Object Notation (JSON) format for storage at the one or more document stores;
generating, based at least on the schema configuration, a query configuration, and a feature matrix of the one or more document stores, one or more queries to match the plurality of data stored at the one or more document stores, wherein the query configuration includes a first object describing an element used in a project clause of a query, and wherein a function selected randomly from the feature matrix is applied to the projected element;
distributing the one or more queries for execution at the one or more document stores by a scalable quantity of concurrently operating worker nodes;
generating one or more performance metrics during the execution of the one or more queries at the one or more document stores; and
applying, based at least on the one or more performance metrics, one or more performance improvements during subsequent query processing at the one or more document stores, wherein the one or more performance improvements include avoiding unnecessary unnesting of JSON objects.
|