| CPC G06Q 30/0621 (2013.01) [G06N 5/022 (2013.01); G06N 5/025 (2013.01)] | 17 Claims |

|
1. A computer-implemented method, comprising:
accessing a set of product formulas, each product formula including a set of ingredient tuples;
generating a directed graph from the set of product formulas, the directed graph including a node for each ingredient of the sets of ingredient tuples of the set of product formulas;
generating a weighted graph from the directed graph, the weighted graph having a weight assigned to each edge in the directed graph;
modifying random walk models by incorporating the weight assigned to each edge into the random walk model;
vectorizing the set of product formulas based on ingredients within the product formulas;
feeding the random walks from the random walk models into a deep learning approach to determine a set of embeddings comprising ingredient embeddings and a set of ingredient vectors from the vectorizing, wherein the deep learning approach is run on data including ingredient recommendations; and substitute ingredient recommendation;
generating, based on the weighted graph, an embedding module;
deploying the random walk models on the weighted graph;
building neural models for two or more positive formulas in the weighted graph by: extracting, using the random walk models, a subgraph with depth d; extracting n biased walks in the subgraph; and building a neural model for each entity in the weighted graph, wherein positive formulas are used to generate a weight of edges to provide the bias for the biased walks on the weighted graph, wherein positive examples are formulas that have a threshold level of market success;
training, using a bias of the biased walks from the weight of the edges, a sequence-to-sequence neural network generated using a pairwise ranking loss functions derived from the neural models;
augmenting the sequence-to-sequence neural network with the set of embeddings for the weighted graph as a sigmoid layer in the sequence-to-sequence neural network,
wherein each node is represented with low-dimensional numerical formula vectors;
deploying the embedding module to generate a new product formula by passing the set of ingredient embeddings through the sequence-to-sequence neural network using one or more of the neural models; and
identifying, using the sequence-to-sequence neural network, a new ingredient tuple for the new product formula.
|