US 12,333,639 B2
	Synthetic audio-driven body animation using voice tempo
Evgeny Aleksandrovich Tumanov, Moscow (RU); Dmitry Aleksandrovich Korobchenko, Moscow (RU); Simon Yuen, Playa Vista, CA (US); and Kevin Margo, Los Gatos, CA (US)
Assigned to NVIDIA Corporation, Santa Clara, CA (US)
Appl. No. 18/007,867
Filed by NVIDIA Corporation, Santa Clara, CA (US)
PCT Filed Nov. 8, 2021, PCT No. PCT/RU2021/000485 § 371(c)(1), (2) Date Dec. 2, 2022, PCT Pub. No. WO2023/080806, PCT Pub. Date May 11, 2023.
Prior Publication US 2024/0233229 A1, Jul. 11, 2024
Int. Cl. G06T 13/00 (2011.01); G06T 13/20 (2011.01); G06T 13/40 (2011.01)

CPC G06T 13/205 (2013.01) [G06T 13/40 (2013.01)]

22 Claims

1. A processor comprising:

one or more circuits to:

generate an audio signal from input audio data;

compute, using a first loss function, one or more differences between the audio signal and audio signals of a plurality of data samples;

determine, based at least on the one or more differences, at least a first data sample and a second data sample from the plurality of data samples, the first data sample including a first audio signal corresponding to a first animation and the second data sample including a second audio signal corresponding to a second animation;

determine, using the first loss function and a second loss function that compares the first animation and the second animation, a transition point between the first audio signal and the second audio signal; and

based at least on the transition point, generate an animation based at least on combining at least a portion of the first animation and at least a portion of the second animation.