US 11,727,013 B2
Optimized processing of data in different formats
Tyler Arthur Akidau, Seattle, WA (US); Thierry Cruanes, San Mateo, CA (US); Istvan Cseri, Seattle, WA (US); Benoit Dageville, San Mateo, CA (US); Tyler Jones, Redwood City, CA (US); and Dinesh Chandrakant Kulkarni, Sammamish, WA (US)
Assigned to Snowflake Inc., Bozeman, MT (US)
Filed by Snowflake Inc., Bozeman, MT (US)
Filed on Sep. 7, 2022, as Appl. No. 17/930,150.
Application 17/930,150 is a continuation of application No. 17/455,594, filed on Nov. 18, 2021, granted, now 11,461,335.
Application 17/455,594 is a continuation in part of application No. 17/386,258, filed on Jul. 27, 2021, granted, now 11,250,006.
Application 17/386,258 is a continuation of application No. 17/226,423, filed on Apr. 9, 2021, granted, now 11,100,111.
Prior Publication US 2022/0414106 A1, Dec. 29, 2022
Int. Cl. G06F 16/245 (2019.01); G06F 16/2455 (2019.01); G06F 16/2453 (2019.01)
CPC G06F 16/24568 (2019.01) [G06F 16/24544 (2019.01)] 18 Claims
OG exemplary drawing
 
1. A method comprising:
storing a first set of data in a first format in a first cloud storage location;
storing a second set of data in a second format in a second cloud storage location;
classifying a first subset of the first set of data in the first format as high-value data and classifying a second subset of the first set of data as low-value data; and
ingesting a copy of the high-value data from the first cloud storage location into the second cloud storage location in the second format, wherein the first subset of the first set of data in the first format is maintained and not deleted in the first cloud storage location in response to ingesting the copy of the high-value data;
providing an interface for accessing the first and second sets of data;
receiving, via the interface, a first query referencing the first and second sets of data;
determining that the first query references the first subset of the first data;
executing the first query using the first subset of data in the second cloud storage location in the second format and the second set of data;
receiving, via the interface, a second query referencing the first and second sets of data, determining that the second query references a second subset of the first set of data not ingested into the second cloud storage location;
converting the second subset of the first set of data from the first format into a common format;
converting the second set of data from the second format into the common format;
joining the second subset of the first set of data in the common format and the second set of data in the common format to generate joined data; and
executing the second query based on the joined data.