US 12,007,992 B2
Serverless data lake indexing subsystem and application programming interface
Rahul Potharaju, Redmond, WA (US); Wentao Wu, Bellevue, WA (US); Terry Y. Kim, Bellevue, WA (US); Lev Novik, Bellevue, WA (US); and Apoorve Dave, Bellevue, WA (US)
Assigned to MICROSOFT TECHNOLOGY LICENSING, LLC, Redmond, WA (US)
Filed by Microsoft Technology Licensing, LLC, Redmond, WA (US)
Filed on Aug. 10, 2022, as Appl. No. 17/818,878.
Application 17/818,878 is a continuation of application No. 16/989,339, filed on Aug. 10, 2020, granted, now 11,449,508.
Claims priority of provisional application 63/020,356, filed on May 5, 2020.
Prior Publication US 2022/0382756 A1, Dec. 1, 2022
Int. Cl. G06F 16/2453 (2019.01); G06F 16/22 (2019.01); G06F 16/25 (2019.01)
CPC G06F 16/24542 (2019.01) [G06F 16/2272 (2019.01); G06F 16/254 (2019.01)] 20 Claims
OG exemplary drawing
 
1. A query processing workload optimization system comprising:
one or more processors; and
one or more memory devices accessible to the one or more processors, the one or more memory devices storing program code for execution by the one or more processors, the program code including:
a query processor that:
receives a query;
generates a query plan for the query;
searches a data lake for built indexes stored at a predetermined location that conforms to an index specification, each built index including respective metadata that conforms to the index specification;
determines, based on the respective index metadata for each built index found, a first built index to optimize the query plan, the first built index comprising first index metadata of the respective index metadata describing a lineage of the first built index, the lineage comprising an operation log that indicates operations that have taken place on the built index and usable by the query processor to determine a data source corresponding to the first built index has been updated;
optimizes the query plan to use the first built index; and
executes the optimized query plan to provide a query result.