| CPC G06F 16/24553 (2019.01) [G06F 16/24542 (2019.01); G06F 16/248 (2019.01)] | 20 Claims |

|
1. A method, comprising:
identifying, by one or more processors coupled to memory, a multi-dimensional sample dataset comprising a plurality of samples, each of the plurality of samples comprising a first tensor identified by a respective first identifier;
identifying, by the one or more processors, a query for the multi-dimensional sample dataset, the query specifying a sampling operation for the multi-dimensional sample dataset, the sampling operation of the query indicating an expression including the respective first identifier of the first tensor and a first weight for a probability distribution of query results to select from the plurality of samples of the multi-dimensional sample dataset;
parsing, by the one or more processors, the query to extract the sampling operation, the expression, and the first weight;
executing, by the one or more processors, the query based on the sampling operation to randomly select a subset of samples from the plurality of samples as a set of query results, the subset of samples selected to include a first number of samples that satisfy the expression for the first tensor, the first number of samples determined based on the first weight; and
providing, by the one or more processors, as output, the set of query results.
|