| CPC G06V 10/88 (2022.01) [G06T 7/194 (2017.01); G06V 10/457 (2022.01); G06V 10/46 (2022.01); G06V 10/761 (2022.01); G06V 10/762 (2022.01); G06V 20/40 (2022.01); G06V 20/46 (2022.01); G06V 20/52 (2022.01); H04N 23/631 (2023.01)] | 17 Claims |

|
1. A system comprising:
a camera;
one or more memory units; and
one or more computer processors communicatively coupled to the one or more memory units and configured to perform operations comprising:
access a plurality of video frames of a video;
identify a plurality of objects from the plurality of video frames by:
generating, using a segmentation algorithm, a plurality of masks, each mask comprising a set of neighboring pixels that are determined to be related; and
extracting the plurality of objects from the plurality of video frames based on the generated plurality of masks;
generate a plurality of composite vectors for the plurality of objects by:
generating a plurality of vectors for each particular object of the plurality of objects extracted from the plurality of video frames; and
generating a particular composite vector for each particular object by combining the plurality of vectors for the particular object;
determine, using the composite vectors for the plurality of objects, a plurality of similar objects; and
display images of one or more of the plurality of similar objects in a graphical user interface.
|