CPC G06F 16/2443 (2019.01) [G06F 16/25 (2019.01); G06F 16/313 (2019.01); G06F 16/907 (2019.01)] | 18 Claims |
1. A computer-implemented method comprising:
identifying a relationship between a first source type identified in a search query and a second source type excluded from being identified in the search query, the relationship identified using a metadata catalog that includes indications of related source types;
identifying field set pairs from a first data set associated with the first source type and a second data set associated with the second source type, wherein each field set pair includes one field set associated with the first source type and another field set associated with the second source type;
for each field set pair, determining an extent of similarity between the corresponding field sets by analyzing at least field names or field values associated with the corresponding field sets;
identifying at least one pair of related field sets based on the corresponding extent of similarities exceeding a similarity threshold or based on the corresponding extent of similarities being a highest set of similarity scores; and
providing an indication of the at least one pair of related field sets.
|