CPC G10L 21/0232 (2013.01) [G10L 15/063 (2013.01); H04R 1/406 (2013.01); H04R 3/005 (2013.01); G10L 2015/0631 (2013.01); G10L 2021/02166 (2013.01)] | 9 Claims |
1. A speech processing device connectable to a sound collecting device disposed in a closed space, the speech processing device comprising:
a processor; and
a memory having instructions that, when executed by the processor, cause the processor to perform operations, the operations comprising:
acquiring speaking person position information, the speaking person position information indicating a positional relationship between the sound collecting device and each of a plurality of persons present in the closed space, the plurality of persons including a main speaking person;
estimating a mixing rate indicating a ratio of a speech signal of the main speaking person to a speech signal of another person other than the main speaking person based on the speaking person position information; and
determining whether suppression of a crosstalk component due to speaking of the another person contained in the speech signal of the main speaking person is necessary based on an estimation result of the mixing rate.
|