| CPC H04L 63/04 (2013.01) [G06N 5/04 (2013.01); G06N 20/00 (2019.01); G06Q 30/0202 (2013.01)] | 20 Claims |

|
1. A method comprising:
receiving, by a processing device, a plurality of stamp representations each corresponding to a respective source document and representing visual features of content of the respective source document without containing the content of the respective source document;
generating, by the processing device and using a machine learning model, a stamp embedding space by processing the plurality of stamp representations;
receiving, by the processing device, an additional stamp representation corresponding to an additional document and having features identifying a type of the additional document; and
identifying, by the processing device, an insight pertaining to the additional document, including the type of the additional document, by comparing the additional stamp representation to the stamp embedding space.
|