| CPC G06V 40/40 (2022.01) [G06F 18/21 (2023.01); G06F 18/22 (2023.01); G06V 20/49 (2022.01); G06V 40/168 (2022.01); G06V 40/70 (2022.01); G10L 17/22 (2013.01)] | 22 Claims |

|
1. A computer-implemented method comprising:
obtaining, by a computer, an audiovisual data sample containing audiovisual data;
applying, by the computer, a machine-learning architecture to the audiovisual data to generate a similarity score using a biometric embedding extracted from the audiovisual data, generate a lip-sync score using one or more lip-sync embeddings extracted from the audiovisual data, and generate a deepfake score using a speaker spoofprint embedding and a facial spoofprint embedding extracted from the audiovisual data; and
generating, by the computer, a final output score indicating a likelihood that the audiovisual data is genuine based upon algorithmically combining the similarity score, the lip-sync score, and the deepfake score.
|