| CPC G10L 21/007 (2013.01) [G06N 20/00 (2019.01); G10L 25/30 (2013.01)] | 6 Claims |

|
1. A voice signal conversion model learning device comprising:
a processor; and
a storage medium having computer program instructions stored thereon, wherein the computer program instruction, when executed by the processor, perform processing of:
acquiring learning input data which is an input voice signal; and
executing a conversion learning model which is a model of machine learning including learning stage conversion processing of converting the learning input data into learning stage conversion destination data which is a voice signal of a conversion destination, wherein
the learning stage conversion processing includes local feature quantity acquisition processing of acquiring a feature quantity for each learning input-side subset which is a subset of processing target input data having the processing target input data as a population, based on the processing target input data which is data to be processed,
the conversion learning model further includes adjustment parameter value acquisition processing of acquiring an adjustment parameter value, which is a value of a parameter for adjusting a statistical value of a distribution of the feature quantity, based on the learning input data, and
the learning stage conversion processing converts the learning input data into the learning stage conversion destination data using a result of a predetermined calculation based on the adjustment parameter value, where the predetermined calculation converts the feature quantity according to the adjustment parameter value using affine conversion.
|