CPC G06F 40/186 (2020.01) [G06F 40/284 (2020.01)] | 17 Claims |
1. A method for converting documents to fillable document templates, the method comprising:
obtaining, by a processing device, a document having a plurality of tokens in corresponding regions in the document, wherein the regions include text;
identifying, via a machine learned model, a token state for each token of the plurality of tokens, wherein each token state indicates whether a corresponding token is a static token to be maintained or a dynamic token to be removed, wherein the machine learned model is trained using dynamic token state indicators assigned to words in a set of similar training documents based on text differences within the set of similar training documents; and
generating a fillable document template corresponding with the document, wherein the fillable document template is generated by, for each dynamic token of the document, removing the dynamic token and replacing the dynamic token with a fillable field that is unfilled enabling text input to be provided into the fillable field.
|