CPC H04L 51/52 (2022.05) [G06F 3/0484 (2013.01); G06Q 50/01 (2013.01); G06F 3/04817 (2013.01); G06F 3/0482 (2013.01); H04L 67/01 (2022.05); H04L 67/306 (2013.01)] | 16 Claims |
1. A digital recipe clipping engine comprising:
at least one processor and a memory operably coupled to the at least one processor and configured to store instructions invoked by the at least one processor;
instructions that, when executed, cause the at least one processor to implement:
an input interface configured to receive digital recipe data comprising a plurality of data items,
an importer configured to:
parse the digital recipe data,
determine if the digital recipe data is in a structured format by associating an XML schema of the digital recipe data with a known schema of one or more known schemas, and
in response to the digital recipe data being determined to be in a structured format, import content of the structured digital recipe data according to the structured format, and
in response to the digital recipe data being determined to be in a non-structured format, convert content of the unstructured digital recipe data to a structured format by:
identifying an item of the plurality of data items;
for each item of the plurality of data items identified, determining the data item is one of expected and unexpected:
for each data item determined to be expected, assigning the data item to a known data field using a content of the data item and a database of expected data; and
for each data item determined to be unexpected, conducting a best guess assignment, wherein the best guess assignment comprises assigning the data item to a known data field based on the contents of the data item and a closeness of the data item to items in the database of expected data.
|