| CPC G06T 13/40 (2013.01) | 16 Claims |

|
1. An apparatus for generating a dancing avatar comprising: one or more processors; and a memory that stores one or more programs executed by the one or more processors,
wherein the processors
receive a latent code and map it using a neural network operation to obtain a plurality of genre-specific style codes for each of a plurality of dance genres, and
decode seed motion data and music data, which are motion data that must be referred to when generating an avatar's dance motion, using a genre-specific style code for a dance genre selected among the plurality of genre-specific style codes as a guide, thereby obtaining a dance vector representing a dance motion feature of the avatar in the selected dance genre,
wherein the processors
receive the dance vector, convert it into a format of the motion data to obtain dance data, and
apply an avatar skin to the obtained dance data, thereby generating a dancing avatar,
wherein during training, the processors
receive the dance data and the music data and project them into a virtual common feature vector space to obtain a motion vector and a music vector,
obtain a feature map by transformer encoding the motion vector and the music vector using a neural network operation,
determine the dance genre of the dance data from a genre score obtained by pooling the feature map, and
calculate a loss according to a difference between the determined dance genre and the selected dance genre, and back-propagate it.
|