US 11,775,864 B2
Feature management platform
Frank Wisniewski, San Francisco, CA (US); Abhishek Jain, Mountain View, CA (US); Caio Vinicius Soares, Redwood City, CA (US); Tristan Cooper Baker, San Diego, CA (US); and Joseph Brian Cessna, San Diego, CA (US)
Assigned to INTUIT, INC., Mountain View, CA (US)
Filed by INTUIT INC., Mountain View, CA (US)
Filed on May 29, 2020, as Appl. No. 16/887,731.
Prior Publication US 2021/0374600 A1, Dec. 2, 2021
Int. Cl. G06F 16/25 (2019.01); G06N 20/00 (2019.01)
CPC G06N 20/00 (2019.01) [G06F 16/254 (2019.01)] 20 Claims
OG exemplary drawing
 
1. A method, comprising:
receiving, at a feature management platform from a first computing device, a processing artifact that defines a feature associated with a data source and a transform;
generating, based on the processing artifact, a processing job configured to retrieve event data from the data source;
initiating the processing job that includes:
retrieving the event data from the data source;
applying the transform to the event data to generate a set of feature values; and
encapsulating the set of feature values within a feature vector for the feature to store in a feature store of the feature management platform;
storing the feature vector in the feature store for a certain amount of time, wherein the feature store is a fast-retrieval database, and wherein the feature management platform is associated with a dual storage system comprising the fast-retrieval database for storing features for the certain amount of time and a separate training data database for storing respective features for longer than the certain amount of time;
storing feature metadata associated with the feature vector in a feature registry;
transmitting the feature vector representing the set of feature values to a model hosted on the first computing device;
receiving, at the feature management platform, from the first computing device, a prediction generated by the model;
transmitting, by the feature management platform, the prediction to the feature store of the feature management platform; and
upon receiving a request from a second computing device for the feature within the certain amount of time after the storing of the feature vector in the feature store:
determining that the feature vector for the feature is stored in the fast-retrieval database of the dual storage system based on locating the feature metadata in the feature registry;
retrieving the feature vector from the fast-retrieval database based on the locating of the feature metadata in the feature registry; and
transmitting the feature vector to the second computing device from the fast-retrieval database without repeating, in response to the request, the retrieving of the event data, the applying of the transform, or the encapsulating of the set of feature values that were earlier performed to create the feature vector.