CPC G06Q 20/047 (2020.05) [G06V 30/40 (2022.01); G06F 18/285 (2023.01); G06V 30/10 (2022.01); G06V 30/19113 (2022.01); G06V 2201/09 (2022.01)] | 20 Claims |
1. A system, comprising:
one or more servers configured to:
receive, from a consumer device, image data associated with a receipt;
cause execution of a first image cleanup with respect to the image data associated with the receipt;
determine, based on the first image cleanup, a merchant logo associated with a merchant;
determine, based on the merchant logo, a receipt structure associated with the merchant;
cause execution of a second image cleanup with respect to the image data associated with the receipt;
extract, based on the second image cleanup, one or more portions of receipt text data from the image data;
determine a text match score based on a comparison of the one or more portions of receipt text data and one or more known keywords associated with the receipt structure related to the merchant;
determine whether the text match score satisfies the text match score threshold; and
in response to determining that the text match score satisfies the text match score threshold,
generate structured receipt data based on the one or more portions of receipt text data.
|