CPC G06V 20/625 (2022.01) [G06V 10/95 (2022.01); G06V 20/58 (2022.01); G06V 30/153 (2022.01); G06V 30/18 (2022.01); G06V 30/1916 (2022.01); G06V 30/19147 (2022.01)] | 30 Claims |
1. A machine-based method of recognizing license plates, comprising:
dividing an image or video frame comprising a license plate in the image or video frame into a plurality of image patches, wherein the image or video frame is divided horizontally and vertically to obtain the image patches, and wherein at least one of the image patches comprises a portion of a character of a license plate number of the license plate;
determining a positional vector for each of the image patches, wherein the positional vector represents a spatial position of each of the image patches in the image or video frame;
adding the positional vector to each of the image patches and inputting the image patches and their associated positional vectors to a transformer encoder of a text-adapted vision transformer run on one or more devices; and
obtaining a prediction, outputted by the text-adapted vision transformer, concerning the license plate number of the license plate.
|