| CPC G06Q 30/0635 (2013.01) [G01C 21/3407 (2013.01); G06Q 10/06311 (2013.01); G06Q 10/06375 (2013.01); G06Q 10/087 (2013.01)] | 17 Claims |

|
1. A method comprising:
obtaining marketplace state data associated with an online concierge system that facilitates processing of a request, from a customer via a customer application, for procurement of one or more items from one or more warehouses, assignment of the request to an available shopper via a shopper application, and generation of routing instructions for delivery of the one or more items by the available shopper to the customer;
applying a hyperparameter learning model to the marketplace state data to predict a set of hyperparameters affecting a set of respective parameterized control decision models, wherein the hyperparameter learning model is trained on historical marketplace state data and a configured outcome objective for the online concierge system, wherein training the hyperparameter learning model comprises:
logging the marketplace state data over a period of time; and
re-training the hyperparameter learning model based on the logged marketplace state data;
independently applying the set of parameterized control decision models to the marketplace state data using the hyperparameters to generate a respective set of control parameters affecting marketplace operation of the online concierge system, comprising:
for each of the set of parameterized control decision models,
modifying parameters of the parameterized control decision model using the hyperparameters,
providing the marketplace state data as input to the modified parameterized control decision, and
receiving one of the set of control parameters as output from the modified parameterized control decision; and
applying the respective set of control parameters to modify an operation of the online concierge system.
|