CPC G10L 15/08 (2013.01) [G10L 17/02 (2013.01); G10L 17/04 (2013.01); G10L 17/08 (2013.01); G10L 17/18 (2013.01); G10L 17/26 (2013.01)] | 11 Claims |
1. An estimation apparatus, configured to:
cluster a group of voice signals including a voice signal having a speaker attribute to be estimated into a plurality of clusters;
identify, from the plurality of clusters, a cluster to which the voice signal to be estimated belongs;
estimate speaker attributes of voice signals in the identified cluster, by using a speaker attribute estimation model trained for estimating a speaker attribute of a voice signal based on a feature of the voice signal; and
estimate an attribute of the entire identified cluster, by using an estimation result of the speaker attributes of the voice signals in the identified cluster, and output an estimation result of the speaker attribute of the entire identified cluster, as an estimation result of the speaker attribute of the voice signal to be estimated.
|