US 12,393,595 B1
System and method for determining a prioritized array of associated datasets
Blake Browder, Dallas, TX (US); and Joy Figarsky, Little Rock, AR (US)
Assigned to Signet Health Corporation, North Richland Hills, TX (US)
Filed by Signet Health Corporation, North Richland Hills, TX (US)
Filed on Nov. 24, 2024, as Appl. No. 18/957,785.
Int. Cl. G06F 16/00 (2019.01); G06F 16/2457 (2019.01); G06F 16/28 (2019.01); G06Q 40/08 (2012.01)
CPC G06F 16/24575 (2019.01) [G06F 16/285 (2019.01); G06Q 40/08 (2013.01)] 16 Claims
OG exemplary drawing
 
1. A system for determining a prioritized array of associated datasets, wherein the system comprises:
at least a processor; and
a memory communicatively connected to the at least a processor, wherein the memory contains instructions configuring the at least a processor to:
receive a plurality of datasets, wherein the plurality of datasets comprises a plurality of instances;
apply a clustering module to the plurality of datasets, wherein the clustering module is configured to assign an individual dataset to an appropriate cluster;
apply a classification module to the plurality of datasets, wherein the classification module is trained on cluster labels of one or more clusters and configured to predict labels for new individual datasets;
generate a prioritized array, wherein generating the prioritized array comprises applying the plurality of instances of the plurality of datasets across one or more axes, wherein the one or more axes are derived from the clustering and classification of the plurality of datasets; and
perform a validation procedure on the prioritized array, wherein the validation procedure comprises:
defining required fields, wherein the required fields are defined by a prioritized array framework;
inspecting a dataset of the plurality of datasets, wherein inspecting the dataset comprises reading the dataset into a suitable data structure and performing a preliminary review to understand the structure of the dataset and identify present fields;
checking for missing values, wherein checking for missing values comprises checking for missing or null values for each of the defined required fields;
flagging incomplete or missing entries; and
generating a summary report, wherein the summary report indicates the flagged incomplete or missing entries.