US 12,423,897 B2
Information processing device, information processing method, and program
Kanna Tominaga, Tokyo (JP); Shuhei Miyazaki, Tokyo (JP); Hiromi Fukaya, Tokyo (JP); Takeshi Matsui, Tokyo (JP); Masayuki Sagano, Tokyo (JP); and Saki Nishihara, Tokyo (JP)
Assigned to SONY GROUP CORPORATION, Tokyo (JP)
Appl. No. 18/699,310
Filed by Sony Group Corporation, Tokyo (JP)
PCT Filed Oct. 6, 2022, PCT No. PCT/JP2022/037498
§ 371(c)(1), (2) Date Apr. 8, 2024,
PCT Pub. No. WO2023/068067, PCT Pub. Date Apr. 27, 2023.
Claims priority of application No. 2021-170366 (JP), filed on Oct. 18, 2021.
Prior Publication US 2024/0404158 A1, Dec. 5, 2024
Int. Cl. G06T 13/40 (2011.01); G06T 13/20 (2011.01); G06T 19/00 (2011.01); G10L 25/57 (2013.01); G10L 25/63 (2013.01)
CPC G06T 13/205 (2013.01) [G06T 13/40 (2013.01); G06T 19/00 (2013.01); G10L 25/57 (2013.01); G10L 25/63 (2013.01)] 12 Claims
OG exemplary drawing
 
1. An information processing device comprising:
circuitry including a CPU that is configured to
recognize an emotion based on a speech waveform;
output a facial expression corresponding to the emotion;
compose an avatar showing the facial expression; and
compose a background corresponding to a scene estimated based on the speech waveform or uttered contents, wherein
the circuitry is further configured to
extract a waveform component indicating an environmental sound from the speech waveform; and
determine the background based on the waveform component.