US 11,996,086 B2
Estimation device, estimation method, and estimation program
Naohiro Tawara, Tokyo (JP); Hosana Kamiyama, Tokyo (JP); Satoshi Kobashikawa, Tokyo (JP); and Atsunori Ogawa, Tokyo (JP)
Assigned to NIPPON TELEGRAPH AND TELEPHONE CORPORATION, Tokyo (JP)
Appl. No. 17/636,826
Filed by NIPPON TELEGRAPH AND TELEPHONE CORPORATION, Tokyo (JP)
PCT Filed Aug. 19, 2019, PCT No. PCT/JP2019/032271
§ 371(c)(1), (2) Date Feb. 18, 2022,
PCT Pub. No. WO2021/033233, PCT Pub. Date Feb. 25, 2021.
Prior Publication US 2022/0335928 A1, Oct. 20, 2022
Int. Cl. G10L 15/08 (2006.01); G10L 17/02 (2013.01); G10L 17/04 (2013.01); G10L 17/08 (2013.01); G10L 17/18 (2013.01); G10L 17/26 (2013.01)
CPC G10L 15/08 (2013.01) [G10L 17/02 (2013.01); G10L 17/04 (2013.01); G10L 17/08 (2013.01); G10L 17/18 (2013.01); G10L 17/26 (2013.01)] 11 Claims
OG exemplary drawing
 
1. An estimation apparatus, configured to:
cluster a group of voice signals including a voice signal having a speaker attribute to be estimated into a plurality of clusters;
identify, from the plurality of clusters, a cluster to which the voice signal to be estimated belongs;
estimate speaker attributes of voice signals in the identified cluster, by using a speaker attribute estimation model trained for estimating a speaker attribute of a voice signal based on a feature of the voice signal; and
estimate an attribute of the entire identified cluster, by using an estimation result of the speaker attributes of the voice signals in the identified cluster, and output an estimation result of the speaker attribute of the entire identified cluster, as an estimation result of the speaker attribute of the voice signal to be estimated.