US 12,216,685 B2
Systems and methods for pattern-based multi-stage deterministic data classification
Zi Cheng Feng, Woodhaven, NY (US)
Assigned to The Travelers Indemnity Company, Hartford, CT (US)
Filed by The Travelers Indemnity Company, Hartford, CT (US)
Filed on Dec. 22, 2023, as Appl. No. 18/394,744.
Application 18/394,744 is a continuation of application No. 17/721,705, filed on Apr. 15, 2022, granted, now 11,893,045.
Prior Publication US 2024/0126788 A1, Apr. 18, 2024
This patent is subject to a terminal disclaimer.
Int. Cl. G06F 16/00 (2019.01); G06F 16/28 (2019.01); G06V 30/19 (2022.01)
CPC G06F 16/285 (2019.01) [G06V 30/19 (2022.01)] 20 Claims
OG exemplary drawing
 
1. A method for pattern-based multi-stage deterministic data classification, comprising:
receiving, by at least one electronic processing device from a plurality of electronic processing devices of a server, an application for an underwriting product, the application comprising at least one data value and at least one attached file;
converting, utilizing an Optical Character Recognition (OCR) algorithm stored in a non-transitory data storage device that is in communication with the at least one electronic processing device, at least one portion of the at least one attached file into at least one searchable text character, thereby defining a searchable version of the at least one attached file;
searching, by the at least one electronic processing device and utilizing a listing of keywords stored in the non-transitory data storage device, the listing of keywords representing required data elements for a consolidated set of strategic classification rules, the searchable version of the at least one attached file;
identifying, by the at least one electronic processing device and based on the searching, and for each required data element for the consolidated set of strategic classification rules, whether a value for the required data element exists in the at least one attached file;
categorizing, by the at least one electronic processing device and based on the identifying of whether the values for the required data elements exist in the at least one attached file, the application into one of a plurality of pre-defined data availability categories;
selecting, by the at least one electronic processing device and based on the categorized one of the plurality of pre-defined data availability categories for the application and utilizing a mapping stored in the non-transitory data storage device, the mapping comprising a mapping of the plurality of data availability categories to a plurality of signature analysis routines, one of the plurality of signature analysis routines;
identifying, by the at least one electronic processing device and by an execution of the selected one of the plurality of signature analysis routines, values for the required data elements for the consolidated set of strategic classification rules;
evaluating, by the at least one electronic processing device and based on the identified values for the required data elements for the consolidated set of strategic classification rules, the consolidated set of strategic classification rules;
identifying, by the at least one electronic processing device and based on the evaluating, a numeric value for a classification; and
automatically routing, by the at least one electronic processing device, the application for the underwriting product to an address assigned to the identified classification.