US 11,748,386 B2
	Method, system, and computer program product for managing source identifiers of clustered records
Sathis kumar Appukutty, San Mateo, CA (US); Sukalyan Chakraborty, Foster City, CA (US); Abhijith Ramesh Kashyap, San Carlos, CA (US); Mahesh Joshi, Hillsborough, CA (US); Urjit Anand Khadilkar, Redwood City, CA (US); Prithwiraj Mitra, Foster City, CA (US); and Nitin Singhal, San Jose, CA (US)
Assigned to Visa International Service Association, San Francisco, CA (US)
Filed by Visa International Service Association, San Francisco, CA (US)
Filed on Sep. 9, 2021, as Appl. No. 17/470,512.
Application 17/470,512 is a continuation of application No. 16/516,751, filed on Jul. 19, 2019, granted, now 11,138,238.
Claims priority of provisional application 62/700,933, filed on Jul. 20, 2018.
Prior Publication US 2021/0406286 A1, Dec. 30, 2021
This patent is subject to a terminal disclaimer.
Int. Cl. G06F 16/28 (2019.01); G06F 16/2458 (2019.01); G06F 16/2457 (2019.01); G06F 16/23 (2019.01)

CPC G06F 16/285 (2019.01) [G06F 16/2358 (2019.01); G06F 16/2457 (2019.01); G06F 16/2468 (2019.01)]

15 Claims

1. A method for managing source identifiers of transaction records, comprising:

receiving, with at least one processor, a plurality of transaction records, each respective transaction record of the plurality of transaction records comprising identification data associated with a source of the respective transaction record, the source for each respective transaction record of the plurality of transaction records comprising a respective merchant, the identification data for each respective transaction record of the plurality of transaction records comprising a respective name and address key pair associated with the respective merchant;

sorting, with the at least one processor, the plurality of transaction records into a first plurality of clusters based on the identification data, each cluster of the first plurality of clusters including at least one transaction record of the plurality of transaction records;

generating, with the at least one processor, a first source identifier for each respective cluster of the first plurality of clusters based on the respective name and address key pair of respective transaction records of the plurality of transaction records sorted into the respective cluster;

receiving, with the at least one processor, update data associated with an update to the plurality of transaction records;

sorting, with the at least one processor, at least some of the plurality of transaction records into a second plurality of clusters based on the identification data and the update data, wherein a first new cluster of the second plurality of clusters has first new identification data different from the identification data associated with an existing cluster of the first plurality of clusters and a second new cluster of the second plurality of clusters has second new identification data that substantially matches the identification data associated with the existing cluster of the first plurality of clusters, wherein substantially matching comprises at least one of exactly matching, matching within a predetermined tolerance, matching based on fuzzy matching, or matching closely enough to have been sorted into a same cluster; and

determining, with the at least one processor, to assign the first source identifier of the existing cluster of the first plurality of clusters to the second new cluster of the second plurality of clusters based on at least one of a number of transacting records of the existing cluster, a number of non-transacting records of the existing cluster, a number of transacting records of the first new cluster, a number of non-transacting records of the first new cluster, a number of transacting records of the second new cluster, or a number of non-transacting records of the second new cluster.