| CPC G06T 19/006 (2013.01) [G06T 1/0021 (2013.01); G06V 10/761 (2022.01); H04N 5/2621 (2013.01)] | 20 Claims |

|
1. A system comprising:
at least one processor;
at least one memory component storing instructions that, when executed by the at least one processor, cause the at least one processor to perform operations comprising:
accessing an input video item that includes a target visual augmentation;
generating, by a machine learning model, an embedding of the input video item, the machine learning model being trained, in an unsupervised training phase, to minimize loss between training video representations generated within each of a plurality of training sets, each training set comprising a plurality of training video items having different video content and that each include the same predefined visual augmentation; and
mapping, based on the generation of the embedding of the input video item, the target visual augmentation to an augmentation identifier.
|