US 12,217,366 B2
Media distribution device and media distribution method
Toshiya Hamada, Tokyo (JP); and Takumi Tsuru, Tokyo (JP)
Assigned to SONY GROUP CORPORATION, Tokyo (JP)
Appl. No. 17/998,500
Filed by SONY GROUP CORPORATION, Tokyo (JP)
PCT Filed May 11, 2021, PCT No. PCT/JP2021/017801
§ 371(c)(1), (2) Date Nov. 11, 2022,
PCT Pub. No. WO2021/241190, PCT Pub. Date Dec. 2, 2021.
Claims priority of application No. 2020-090539 (JP), filed on May 25, 2020.
Prior Publication US 2023/0215102 A1, Jul. 6, 2023
Int. Cl. G06T 19/00 (2011.01); G06T 7/20 (2017.01); G10L 13/04 (2013.01); G10L 19/16 (2013.01)
CPC G06T 19/003 (2013.01) [G06T 7/20 (2013.01); G10L 13/04 (2013.01); G10L 19/167 (2013.01); G06T 2210/61 (2013.01)] 5 Claims
OG exemplary drawing
 
1. A media distribution device, comprising:
a rendering processing unit configured to generate a rendered image;
a guide voice generation unit configured to generate a guide voice that describes the rendered image viewed from a viewpoint in a virtual space, wherein
the guide voice is generated based on a scene description and a user viewpoint information,
the scene description describes a scene in the virtual space,
the user viewpoint information indicates a position and a direction of the viewpoint of a user, and
the guide voice generation unit includes:
an encoding parameter generation unit configured to generate an encoding parameter based on information supplied from the rendering processing unit;
a scene description text generation unit configured to generate a description text based on the rendered image and the encoding parameter, wherein the description text includes the scene description that describes the scene in the virtual space; and
a voice conversion unit configured to convert the description text into a voice; and
an audio encoding unit configured to:
mix the guide voice with original audio; and
encode the mixed guide voice.