US 12,294,850 B2
Audio processing apparatus and method, and program
Yuki Yamamoto, Tokyo (JP); Toru Chinen, Kanagawa (JP); and Minoru Tsuji, Chiba (JP)
Assigned to Sony Group Corporation, Tokyo (JP)
Filed by Sony Group Corporation, Tokyo (JP)
Filed on May 14, 2024, as Appl. No. 18/663,637.
Application 18/663,637 is a continuation of application No. 17/993,001, filed on Nov. 23, 2022, granted, now 12,096,202.
Application 17/993,001 is a continuation of application No. 17/474,669, filed on Sep. 14, 2021, granted, now 11,540,080.
Application 17/474,669 is a continuation of application No. 16/734,211, filed on Jan. 3, 2020, granted, now 11,140,505.
Application 16/734,211 is a continuation of application No. 15/737,026, granted, now 10,567,903, previously published as PCT/JP2016/067195, filed on Jun. 9, 2016.
Claims priority of application No. 2015-126650 (JP), filed on Jun. 24, 2015; and application No. 2015-148683 (JP), filed on Jul. 28, 2015.
Prior Publication US 2024/0298137 A1, Sep. 5, 2024
This patent is subject to a terminal disclaimer.
Int. Cl. H04S 3/00 (2006.01); G10L 19/008 (2013.01); H04S 7/00 (2006.01); H04S 5/02 (2006.01)
CPC H04S 7/303 (2013.01) [G10L 19/008 (2013.01); H04S 3/008 (2013.01); H04S 5/02 (2013.01); H04S 2400/01 (2013.01); H04S 2400/11 (2013.01); H04S 2400/13 (2013.01); H04S 2400/15 (2013.01)] 3 Claims
OG exemplary drawing
 
1. An audio processing apparatus comprising:
an acquisition unit configured to acquire metadata including position information indicative of a position of an audio object and sound image information configured from a vector of two or more dimensions and representative of an extent of a sound image from the position;
a vector calculation unit configured to calculate, based on a horizontal direction angle and a vertical direction angle of a region representative of the extent of the sound image determined by the sound image information, a spread vector indicative of a position in the region; and
a gain calculation unit configured to calculate, based on the spread vector and using vector base amplitude panning (VBAP), a gain of each of audio signals supplied to two or more sound outputting units positioned in the proximity of the position indicated by the position information, wherein
the gain calculation unit calculates the gain for each spread vector in regard to each of the sound outputting units, calculates an addition value of the gains calculated in regard to the spread vectors for each of the sound outputting units, normalizes the addition value, and calculates a final gain for each of the sound outputting units based on the normalized addition value.