CPC G10L 21/0232 (2013.01) [G10L 15/22 (2013.01); G10L 15/30 (2013.01); H04R 1/406 (2013.01); H04R 3/005 (2013.01); G10L 2021/02166 (2013.01)] | 13 Claims |
1. An information processing device, comprising:
an external apparatus output sound characteristic database that includes sound source directions of a plurality of external apparatus output sounds, wherein each sound source direction of the sound source directions corresponds to a respective external apparatus output sound of the plurality of external apparatus output sounds; and
a central processing unit (CPU) configured to:
analyze a plurality of characteristics of an external apparatus output sound of an external apparatus, wherein
the plurality of characteristics of the external apparatus output sound includes a first sound source direction of the external apparatus output sound and a first frequency characteristic of the external apparatus output sound,
the plurality of external apparatus output sounds includes the external apparatus output sound, and
the sound source directions include the first sound source direction;
record the analyzed plurality of characteristics of the external apparatus output sound in the external apparatus output sound characteristic database;
cause output of audio data having a second frequency characteristic from the external apparatus;
receive an input sound that is acquired by a microphone array;
execute analysis of the input sound; and
analyze, based on the executed analysis of the input sound, a second sound source direction of the input sound and a third frequency characteristic of the input sound, wherein the sound source directions include the second sound source direction;
extract a user spoken voice from the input sound acquired by the microphone array;
determine that the input sound includes the external apparatus output sound based on the sound source directions of the plurality of external apparatus output sounds; and
remove the external apparatus output sound from the input sound based on
a feature amount of the external apparatus output sound recorded in the external apparatus output sound characteristic database, and
the determination that the input sound includes the external apparatus output sound.
|