US 12,405,924 B2
Managed tables for data lakes
Victor Sergeyevich Agababov, Seattle, WA (US); Shuang Guan, Sunnyvale, CA (US); Thibaud Hottelier, Seattle, WA (US); Anoop Kochummen Johnson, Fremont, CA (US); Justin Levandoski, Seattle, WA (US); Bigang Li, Redmond, WA (US); and Yuri Volobuev, Walnut Creek, CA (US)
Assigned to Google LLC, Mountain View, CA (US)
Filed by Google LLC, Mountain View, CA (US)
Filed on Nov. 14, 2023, as Appl. No. 18/389,337.
Claims priority of provisional application 63/535,811, filed on Aug. 31, 2023.
Prior Publication US 2025/0077478 A1, Mar. 6, 2025
Int. Cl. G06F 16/23 (2019.01); G06F 12/02 (2006.01); G06F 16/18 (2019.01); G06F 16/22 (2019.01); G06F 16/28 (2019.01)
CPC G06F 16/1805 (2019.01) [G06F 12/0253 (2013.01); G06F 16/221 (2019.01); G06F 16/2358 (2019.01); G06F 16/2365 (2019.01); G06F 16/2379 (2019.01); G06F 16/283 (2019.01)] 20 Claims
OG exemplary drawing
 
1. A method for processing queries, comprising:
receiving, by one or more processors, a request from a query engine to write one or more tuples;
writing, by the one or more processors, the one or more tuples to a write-optimized storage in a row-oriented format in a distributed file system that supports file appends;
converting, by the one or more processors, the one or more tuples to one or more data files in a columnar-oriented format compatible with the query engine;
storing, by the one or more processors, the one or more data files in a read-optimized cloud storage in the columnar-oriented format compatible with the query engine; and
committing, by the one or more processors, the write as an addition to a table transaction log stored in the distributed file system by writing a row-level addition to the table transaction log.