US 12,450,263 B1
Systems and methods for stratification of datasets to resolve one or more conforming values in a multi-tenant system
Weijie Qu, Beijing (CN); Nikki Wong, San Anselmo, CA (US); and Sai Ram Rakesh Gorenta, Lathrop, CA (US)
Assigned to Zuora, Inc., Redwood City, CA (US)
Filed by Zuora, Inc., Redwood City, CA (US)
Filed on Jul. 25, 2024, as Appl. No. 18/784,655.
Int. Cl. G06F 16/00 (2019.01); G06F 16/16 (2019.01); G06F 16/28 (2019.01)
CPC G06F 16/285 (2019.01) [G06F 16/168 (2019.01)] 12 Claims
OG exemplary drawing
 
1. A multi-tenant system, comprising:
one or more hardware processors; and
memory storing computer instructions, the computer instructions when executed by the one or more hardware processors configured to perform:
ingesting a dataset comprising transaction records, each of the transaction records comprising a field value and attribute values of different attributes;
testing candidate stratification mechanisms on the dataset, each candidate stratification mechanism stratifying the dataset according to a different attribute;
determining a test value from field values for each attribute value within each stratification of each candidate stratification mechanism;
evaluating each candidate stratification mechanism based on one or more metrics following stratification according to the candidate stratification mechanism on the dataset, the one or more metrics indicating a proportion of conforming field values that are within a threshold range of a corresponding test value;
selecting a particular stratification mechanism from the candidate stratification mechanisms based on the one or more metrics;
stratifying the dataset according to the particular stratification mechanism;
resolving one or more conforming values from the stratified dataset, the resolving comprising resolving a median value from the field values within each stratification of the selected particular stratification mechanism, the proportion of conforming field values being determined based on one or more constraints indicating a permitted range of deviations of each of the field values from the resolved median value within each stratification;
receiving an indication of an override of a resolved median value to a modified median value; and
recomputing the one or more metrics based on the modified median value.