CPC G06F 16/285 (2019.01) [G06F 16/24573 (2019.01)] | 20 Claims |
1. A computer-implemented method for generating cluster templates used for creating extraction models, comprising:
receiving a plurality of training files associated with a selected class;
performing an automated visual analysis on each of the plurality of training files;
performing an automated contextual analysis on each of the plurality of training files;
performing a first clustering of the plurality of training files into a first plurality of clusters using results from the automated visual analysis;
performing a second clustering of one of the first plurality of clusters into a second plurality of clusters using results from the automated contextual analysis; and
generating cluster templates for the first and second plurality of clusters, wherein
the first and the second plurality of clusters are clusters of the plurality of training files.
|