US 11,682,020 B2
System and method to replay suspicious activity detection pipeline in risk networks
Srinivasan S. Muthuswamy, Bangalore (IN); Subhendu Das, Chapel Hill, NC (US); Mukesh Kumar, Bangalore (IN); and Willie Robert Patten, Jr., Hurdle Mills, NC (US)
Assigned to International Business Machines Corporation, Armonk, NY (US)
Filed by International Business Machines Corporation, Armonk, NY (US)
Filed on Jul. 13, 2021, as Appl. No. 17/374,485.
Prior Publication US 2023/0019453 A1, Jan. 19, 2023
This patent is subject to a terminal disclaimer.
Int. Cl. G06Q 20/40 (2012.01); G06F 11/30 (2006.01); G06F 11/32 (2006.01)
CPC G06Q 20/4016 (2013.01) [G06F 11/3072 (2013.01); G06F 11/327 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A computer-implemented method for rerunning computer-generated alert reports comprising:
identifying an initial computer-generated alert report to be rerun;
collecting information on the initial computer-generated alert report;
gathering information on the configuration of an initial computer-generated data analytics pipeline that generated the initial computer-generated alert report,
wherein the initial computer-generated data analytics pipeline comprises one or more initial computer-implemented data analytics models and at least one initial data preparation group consisting of initial electronic data filters, initial electronic data transform functions, and combinations thereof that process electronic input data to create an initial feature set for use by the one or more initial computer-implemented data analytics models, and wherein the initial electronic data filters process electronic data to obtain desired electronic data in a desired format, the initial electronic data transform functions transform electronic data to generate one or more initial feature sets, and the initial computer-implemented data analytics models comprise initial unsupervised machine learning models, and
further wherein the initial computer-generated data analytics pipeline comprises the one or more initial computer-implemented data analytics models and one or more of the initial data preparation groups configured to create a flow of initial tasks that receives electronic input data as input to the initial computer-generated data analytics pipeline and processes the electronic input data to generate the initial computer-generated alert report;
gathering electronic data used to generate the initial computer-generated alert report;
creating a recreated computer-generated data analytics pipeline based upon the information gathered on the configuration of the initial computer-generated data analytics pipeline that generated the initial computer-generated alert report,
wherein the recreated computer-generated data analytics pipeline comprises one or more recreated computer-implemented data analytics models and at least one recreated data preparation group consisting of recreated electronic data filters, recreated electronic data transform functions, and combinations thereof that process electronic input data to create a recreated feature set for use by the one or more recreated computer-implemented data analytics models, and wherein the recreated electronic data filters process the gathered electronic data to obtain desired recreated electronic data in a desired recreated format, the recreated electronic data transform functions transform electronic data to generate one or more recreated feature sets, and the recreated computer-implemented data analytics models comprise recreated unsupervised machine learning models, and
further wherein the recreated computer-generated data analytics pipeline comprises the recreated computer-implemented data analytics models and one or more of the recreated data preparation groups configured to create a recreated flow of recreated tasks that receives the gathered electronic input data as input to the recreated computer-generated data analytics pipeline and processes the gathered electronic input data to create a rerun computer-generated alert report; and
running the recreated computer-generated data analytics pipeline using the gathered electronic data to create the rerun computer-generated alert report.