US 11,722,832 B2
Signal processing apparatus and method, and program
Minoru Tsuji, Chiba (JP); Toru Chinen, Kanagawa (JP); and Mitsuyuki Hatanaka, Kanagawa (JP)
Assigned to Sony Corporation, Tokyo (JP)
Appl. No. 16/762,304
Filed by Sony Corporation, Tokyo (JP)
PCT Filed Oct. 31, 2018, PCT No. PCT/JP2018/040425
§ 371(c)(1), (2) Date May 7, 2020,
PCT Pub. No. WO2019/098022, PCT Pub. Date May 23, 2019.
Claims priority of application No. 2017-219450 (JP), filed on Nov. 14, 2017.
Prior Publication US 2021/0176581 A1, Jun. 10, 2021
Int. Cl. H04S 7/00 (2006.01); H04S 3/00 (2006.01)
CPC H04S 7/302 (2013.01) [H04S 3/008 (2013.01); H04S 7/308 (2013.01); H04S 2400/01 (2013.01); H04S 2400/11 (2013.01); H04S 2420/01 (2013.01)] 11 Claims
OG exemplary drawing
 
8. A signal processing method, by a signal processing apparatus, comprising:
displaying a listening space image on a display screen, the listening space image including sound images of audio objects and localization position marks corresponding to the sound images, wherein the sound images correspond to audio tracks, and wherein the listening space image indicates positions of the sound images in a listening space;
receiving a selection of a sound image of the displayed sound images based on selection by a user of an audio track of the audio tracks;
moving a position of the localization position mark corresponding to the selected sound image on the display screen in response to a user operation;
determining a localization position of the selected sound image relative to a listening position in the listening space based on a position of the moved localization position mark on the image, wherein the localization position of the selected sound image is determined using coordinates of a coordinate system having the listening position in the listening space as an origin;
calculating gain values of audio channels based on the determined localization position of the selected sound image and positions of speakers relative to the listening position; and
generating a bit stream on a basis of information associated with the determined localization position, the bit stream including the calculated gain values, wherein the bit stream is generated by treating the information associated with the localization position as meta information of the selected sound image, wherein the meta information includes position coordinates of the selected sound image, and wherein the listening space image displayed on the display screen includes a point of view image of the listening space as viewed from the listening position and an overhead image of the listening space as viewed from above.