US 12,072,880 B2
Scan parsing
Prashanth Menon, Jersey City, NJ (US); Alexander Behm, Lafayette, CA (US); and Sriram Krishnamurthy, San Francisco, CA (US)
Assigned to Databricks, Inc., San Francisco, CA (US)
Filed by Databricks, Inc., San Francisco, CA (US)
Filed on Aug. 22, 2022, as Appl. No. 17/892,376.
Prior Publication US 2024/0061839 A1, Feb. 22, 2024
Int. Cl. G06F 9/00 (2018.01); G06F 16/2453 (2019.01); G06F 16/28 (2019.01)
CPC G06F 16/24542 (2019.01) [G06F 16/285 (2019.01)] 19 Claims
OG exemplary drawing
 
1. A system, comprising:
a memory;
one or more processors configured to:
receive an indication that a first file is to be processed;
store an order of processing engines, the order specifying at least a first processing engine and a second processing engine after the first processing engine, wherein the second processing engine is equipped to handle at least a particular type of parsing error, wherein a parsing speed of the first processing engine is faster than a parsing speed of the second processing engine, and wherein the first processing engine has a higher susceptibility to the particular type of parsing error than the second processing engine;
determine to begin processing the first file using a first processing engine based at least in part on one or more predefined heuristics;
indicate to process the first file using the first processing engine;
determine whether the particular type of parsing error in processing the first file using the first processing engine has been detected;
in response to determining that the particular error has been detected, indicate to stop processing the first file using the first processing engine and indicate to continue processing using a second processing engine; and
store in memory information obtained based on processing the first file by one or more of the first processing engine and the second processing engine.