US 12,380,905 B2
Signal processing apparatus and method, training apparatus and method
Naoya Takahashi, Tokyo (JP)
Assigned to SONY GROUP CORPORATION, Tokyo (JP)
Filed by SONY GROUP CORPORATION, Tokyo (JP)
Filed on Jan. 10, 2024, as Appl. No. 18/408,991.
Application 18/408,991 is a continuation of application No. 16/769,122, granted, now 11,894,008, previously published as PCT/JP2018/043694, filed on Nov. 28, 2018.
Claims priority of application No. 2017-237401 (JP), filed on Dec. 12, 2017.
Prior Publication US 2024/0144945 A1, May 2, 2024
This patent is subject to a terminal disclaimer.
Int. Cl. G10L 21/00 (2013.01); G10L 21/007 (2013.01); G10L 21/013 (2013.01); G10L 21/028 (2013.01); G10L 25/00 (2013.01)
CPC G10L 21/007 (2013.01) [G10L 21/013 (2013.01); G10L 21/028 (2013.01)] 18 Claims
OG exemplary drawing
 
1. A signal processing apparatus, comprising:
a central processing unit (CPU) configured to:
receive first acoustic data of a sound of an input sound source;
receive a voice quality converter parameter, wherein
the voice quality converter parameter is trained based on a discriminator parameter,
the discriminator parameter is trained based on first training data of the sound of the input sound source, second training data of a sound of a target sound source, and third training data of a sound of a sound source different from the input sound source and the target sound source,
the target sound source is different from the input sound source,
the first training data is based on second acoustic data of a mixed sound, and
the second acoustic data is different from parallel data and clean data; and
convert the first acoustic data of the input sound source to third acoustic data of voice quality of the target sound source, wherein the conversion of the first acoustic data to the third acoustic data is based on the voice quality converter parameter.