US 12,450,841 B2
Embeddings representing visual augmentations
Zhenpeng Zhou, Newark, CA (US); Patrick Poirson, Gilbert, AZ (US); Maksim Gusarov, Marina del Rey, CA (US); Chen Wang, Great Neck, NY (US); and Oleg Tovstyi, Los Angeles, CA (US)
Assigned to Snap Inc., Santa Monica, CA (US)
Filed by Snap Inc., Santa Monica, CA (US)
Filed on Apr. 20, 2023, as Appl. No. 18/304,078.
Prior Publication US 2024/0355063 A1, Oct. 24, 2024
Int. Cl. G06T 19/00 (2011.01); G06T 1/00 (2006.01); G06V 10/74 (2022.01); H04N 5/262 (2006.01)
CPC G06T 19/006 (2013.01) [G06T 1/0021 (2013.01); G06V 10/761 (2022.01); H04N 5/2621 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A system comprising:
at least one processor;
at least one memory component storing instructions that, when executed by the at least one processor, cause the at least one processor to perform operations comprising:
accessing an input video item that includes a target visual augmentation;
generating, by a machine learning model, an embedding of the input video item, the machine learning model being trained, in an unsupervised training phase, to minimize loss between training video representations generated within each of a plurality of training sets, each training set comprising a plurality of training video items having different video content and that each include the same predefined visual augmentation; and
mapping, based on the generation of the embedding of the input video item, the target visual augmentation to an augmentation identifier.