US 12,333,639 B2
Synthetic audio-driven body animation using voice tempo
Evgeny Aleksandrovich Tumanov, Moscow (RU); Dmitry Aleksandrovich Korobchenko, Moscow (RU); Simon Yuen, Playa Vista, CA (US); and Kevin Margo, Los Gatos, CA (US)
Assigned to NVIDIA Corporation, Santa Clara, CA (US)
Appl. No. 18/007,867
Filed by NVIDIA Corporation, Santa Clara, CA (US)
PCT Filed Nov. 8, 2021, PCT No. PCT/RU2021/000485
§ 371(c)(1), (2) Date Dec. 2, 2022,
PCT Pub. No. WO2023/080806, PCT Pub. Date May 11, 2023.
Prior Publication US 2024/0233229 A1, Jul. 11, 2024
Int. Cl. G06T 13/00 (2011.01); G06T 13/20 (2011.01); G06T 13/40 (2011.01)
CPC G06T 13/205 (2013.01) [G06T 13/40 (2013.01)] 22 Claims
OG exemplary drawing
 
1. A processor comprising:
one or more circuits to:
generate an audio signal from input audio data;
compute, using a first loss function, one or more differences between the audio signal and audio signals of a plurality of data samples;
determine, based at least on the one or more differences, at least a first data sample and a second data sample from the plurality of data samples, the first data sample including a first audio signal corresponding to a first animation and the second data sample including a second audio signal corresponding to a second animation;
determine, using the first loss function and a second loss function that compares the first animation and the second animation, a transition point between the first audio signal and the second audio signal; and
based at least on the transition point, generate an animation based at least on combining at least a portion of the first animation and at least a portion of the second animation.