US 11,704,321 B2
Techniques for relationship discovery between datasets
Robert James Oberbreckling, Boulder, CO (US); Luis E. Rivas, Denver, CO (US); Michael Malak, Denver, CO (US); and Glenn Allen Murray, Boulder, CO (US)
Assigned to Oracle International Corporation, Redwood Shores, CA (US)
Filed by Oracle International Corporation, Redwood Shores, CA (US)
Filed on Mar. 23, 2020, as Appl. No. 16/827,587.
Application 16/827,587 is a continuation of application No. 15/705,160, filed on Sep. 14, 2017, granted, now 10,650,000.
Claims priority of provisional application 62/395,349, filed on Sep. 15, 2016.
Prior Publication US 2020/0242111 A1, Jul. 30, 2020
This patent is subject to a terminal disclaimer.
Int. Cl. G06F 16/2455 (2019.01); G06F 16/22 (2019.01); G06F 16/248 (2019.01); G06F 16/25 (2019.01)
CPC G06F 16/2456 (2019.01) [G06F 16/221 (2019.01); G06F 16/248 (2019.01); G06F 16/254 (2019.01)] 20 Claims
OG exemplary drawing
 
1. A method comprising, at a computer system:
accessing a first plurality of columns in a first dataset stored in a first data source;
accessing a second plurality of columns in a second dataset stored in a second data source;
identifying a plurality of column pairs between the first dataset and the second dataset, wherein each column pair in the plurality of column pairs includes a different one of the first plurality of columns and a different one of the second plurality of columns, and wherein all possible pairs of column pairs are identified between the first dataset and the second dataset;
determining one or more column pairs from the plurality of identified column pairs to exclude;
excluding at least one column pair from the one or more determined column pairs; and
for each of the one or more column pairs remaining after the excluding step:
based on a type of join specified via a graphical interface, computing a plurality of scores for a column pair, each of the plurality of scores computed based on a different one of a plurality of scoring functions, the score indicating a measure for joining columns in the column pair.