US 12,254,482 B2
	Systems and methods for contract based offer generation
Jacob Solotaroff, Palo Alto, CA (US); and Jamie Rapperport, Palo Alto, CA (US)
Assigned to Maplebear Inc., San Francisco, CA (US)
Filed by MAPLEBEAR INC., San Francisco, CA (US)
Filed on Jan. 25, 2023, as Appl. No. 18/159,249.
Application 18/159,249 is a division of application No. 17/573,620, filed on Jan. 11, 2022, abandoned.
Application 17/573,620 is a continuation in part of application No. 16/120,178, filed on Aug. 31, 2018, abandoned.
Application 16/120,178 is a continuation of application No. 15/990,005, filed on May 25, 2018, abandoned.
Application 15/990,005 is a continuation in part of application No. 14/209,851, filed on Mar. 13, 2014, granted, now 9,984,387, issued on May 29, 2018.
Application 17/573,620 is a continuation in part of application No. 16/157,018, filed on Oct. 10, 2018, granted, now 10,915,912, issued on Feb. 9, 2021.
Application 17/573,620 is a continuation in part of application No. 16/216,997, filed on Dec. 11, 2018, granted, now 11,270,325, issued on Mar. 8, 2022.
Claims priority of provisional application 63/143,847, filed on Jan. 30, 2021.
Claims priority of provisional application 61/780,630, filed on Mar. 13, 2013.
Claims priority of provisional application 62/576,742, filed on Oct. 25, 2017.
Claims priority of provisional application 62/553,133, filed on Sep. 1, 2017.
Prior Publication US 2023/0169530 A1, Jun. 1, 2023
Int. Cl. G06Q 30/02 (2023.01); G06Q 30/0201 (2023.01); G06Q 30/0211 (2023.01); G06Q 30/0251 (2023.01)

CPC G06Q 30/0206 (2013.01) [G06Q 30/0211 (2013.01); G06Q 30/0255 (2013.01); G06Q 30/0271 (2013.01)]

20 Claims

1. A computer implemented method comprising:

receiving, at an offer generation system, data describing an offer on a product;

applying a natural language processing (NLP) model to the received data to extract the product and the offer;

accessing a plurality of test offers stored by an offer bank;

accessing transaction logs related to the product;

scoring each test offer in the offer bank against the extracted product, wherein scoring a test offer of the plurality of test offers comprises:

predicting a forecast score for each test offer by:

applying a reinforcement learning model to the transaction logs related to the product, wherein the reinforcement learning model is trained using transaction logs associated with the plurality of test offers to predict a likelihood that a test offer will achieve an offer objective;

determining a difference between the offer and the test offer; and

updating the forecast score by applying a penalty to the forecast score based on the determined difference;

selecting a subset of the plurality of test offers based in part on the forecast score of each test offer, wherein assigning the subset of test offers comprises maximizing orthogonality of a set of variables associated with the product;

transmitting the subset of test offers to client devices of users;

receiving, from client devices of the users, responses to the subset of test offers, the responses comprising at least one of (1) whether or not a user viewed a test offer, (2) whether or not a user clicked through a test offer, (3) whether or not a user deleted a test offer, (4) whether or not the user saved the test offer to an electronic coupon folder, (5) whether or not a user forwarded the test offer to someone else, (6) whether or not a user posted a test offer to an online forum, or (7) whether or not the user purchased a product using the test offer;

storing the received responses in a database; and

retraining the reinforcement learning model based on the responses stored in the database, such that the reinforcement learning model automatically learns from user responses and continuously improves effectiveness of selection of test offers.