| CPC G06T 13/205 (2013.01) [G06T 13/40 (2013.01); G06T 19/00 (2013.01); G10L 25/57 (2013.01); G10L 25/63 (2013.01)] | 12 Claims |

|
1. An information processing device comprising:
circuitry including a CPU that is configured to
recognize an emotion based on a speech waveform;
output a facial expression corresponding to the emotion;
compose an avatar showing the facial expression; and
compose a background corresponding to a scene estimated based on the speech waveform or uttered contents, wherein
the circuitry is further configured to
extract a waveform component indicating an environmental sound from the speech waveform; and
determine the background based on the waveform component.
|