US 12,216,648 B1
Row-order dependent dataframe workloads
Srilakshmi Chintala, Seattle, WA (US); Jianzhun Du, Kirkland, WA (US); Naresh Kumar, Santa Clara, CA (US); Srinath Shankar, Belmont, CA (US); Leonhard Franz Spiegelberg, San Francisco, CA (US); Eric Shawn Vandenberg, Saratoga, CA (US); Andong Zhan, San Mateo, CA (US); and Yun Zou, Sunnyvale, CA (US)
Assigned to Snowflake Inc., Bozeman, MT (US)
Filed by Snowflake Inc., Bozeman, MT (US)
Filed on Dec. 21, 2023, as Appl. No. 18/393,324.
Claims priority of provisional application 63/583,525, filed on Sep. 18, 2023.
Int. Cl. G06F 16/242 (2019.01); G06F 16/2453 (2019.01); G06F 16/25 (2019.01)
CPC G06F 16/244 (2019.01) [G06F 16/2453 (2019.01); G06F 16/256 (2019.01)] 30 Claims
OG exemplary drawing
 
1. A method comprising:
receiving, by at least one hardware processor, instructions to perform an order-dependent DataFrame operation on data stored in a first database in a cloud data platform, the instructions specified using code authored in a programming language for executing the order-dependent DataFrame operation within the cloud data platform;
analyzing the instructions to identify the order-dependent DataFrame operation;
generating an executable query corresponding to the identified order-dependent DataFrame operation;
executing the generated executable query on the data stored in the first database in the cloud data platform;
creating a row position column that generates a row position column order to make the data accessible via positional indexing;
assigning each row of the first database a unique row position value based on the row position column and the row position column order;
ordering a result from performing the order-dependent DataFrame operation based on the unique row position value assigned to each row of the first database; and
returning, to a user, the ordered result.