US 12,008,001 B2
Overlap queries on a distributed database
Matthew J. Glickman, Larchmont, NY (US); Orestis Kostakis, Redmond, WA (US); and Justin Langseth, Kailua, HI (US)
Assigned to Snowflake Inc., Bozeman, MT (US)
Filed by Snowflake Inc., Bozeman, MT (US)
Filed on May 27, 2022, as Appl. No. 17/804,434.
Prior Publication US 2023/0385284 A1, Nov. 30, 2023
Int. Cl. G06F 16/245 (2019.01); G06F 16/24 (2019.01); G06F 16/242 (2019.01); G06F 16/2455 (2019.01)
CPC G06F 16/24568 (2019.01) [G06F 16/244 (2019.01); G06F 16/2456 (2019.01); G06F 16/24564 (2019.01)] 30 Claims
OG exemplary drawing
 
1. A method comprising:
identifying a user dataset on a database system;
generating a similarity score dataset that indicates a similarity between the user dataset and a plurality of datasets of other users of the database system;
generating, using the one or more processors of the machine, a plurality of overlap queries that are configured to output overlap datasets between the user dataset and one or more of the plurality of datasets based on the similarity score dataset;
generating a results dataset in an automatic manner without input from a user by applying one or more of the plurality of overlap queries to a joined dataset comprising data fields from the user dataset and one of the plurality of datasets of the other users on the database system, the generating of the results dataset including identifying one or more solvable problems based on the plurality of overlap queries;
executing the one or more of the plurality of overlap queries for overlap analysis, the overlap analysis identifying one or more unresolved solvable problems based on the one or more solvable problems;
providing the results dataset comprising the one or more unresolved solvable problems to a computing device of the user; and
providing one or more recommended solutions to the one or more unresolved solvable problems on the computing device of the user.