CPC G06F 16/951 (2019.01) [G06F 16/38 (2019.01)] | 20 Claims |
1. A system for identifying relevant information for an entity comprising:
one or more processors; and
a memory storing instructions that, when executed by the one or more processors, cause the system to:
generate a plurality of search queries comprising a seed entity and one or more entities associated with the seed entity, the generation comprising:
determining a second entity validated to be linked to the seed entity, the second entity and the seed entity forming a seed cluster;
identifying properties associated with the second entity and the seed entity;
generating a search query that is associated with a subset of the identified properties;
determining that the seed entity is associated with a third entity; and
in response to the determination that the seed entity is associated with the third entity:
determining degrees of difference between:
a first link between the seed entity and the second entity; and
a second link between the third entity and a fourth entity validated to be linked to the third entity;
determining a probability of a match between one or more types of the identified properties and a particular backend datasource against which the search query is run, selected from different backend datasources; and
creating a second search query based on the determined degrees of difference and the determined probabiltiy of the match.
|