CPC G06V 10/30 (2022.01) [G06T 5/002 (2013.01); G06V 30/418 (2022.01); G06T 2207/20081 (2013.01); G06T 2207/30176 (2013.01)] | 20 Claims |
1. A system for removing noise artifacts from documents, comprising:
a memory operable to store a training clean image of a document, wherein the training clean image is free of noise artifacts;
a processor, operably coupled with the memory, and configured to:
receive an image of the document, wherein the image contains a noise artifact at least partially obstructing a portion of the image;
extract a first set of features from the image, wherein the first set of features represents at least one of shapes, symbols, numbers, and text in the image;
identify noise artifact features from the first set of features, wherein the noise artifact features represent pixel values of the noise artifact;
generate a second set of features by removing the noise artifact features from the first set of features;
generate a test clean image of the document based at least in part upon the second set of features as an input;
compare a portion of the test clean image that previously displayed the noise artifact with a counterpart portion of the training clean image to determine whether the noise artifact is removed from the test clean image;
determine whether the portion of the test clean image that previously displayed the noise artifact corresponds to the counterpart portion of the training clean image; and
in response to determining that the portion of the test clean image that previously displayed the noise artifact corresponds to the counterpart portion of the training clean image, output the test clean image of the document.
|