US 11,735,201 B2
	Speech processing device and speech processing method
Masanari Miyamoto, Fukuoka (JP)
Assigned to PANASONIC INTELLECTUAL PROPERTY MANAGEMENT CO., LTD., Osaka (JP)
Filed by PANASONIC INTELLECTUAL PROPERTY MANAGEMENT CO., LTD., Osaka (JP)
Filed on Jun. 28, 2022, as Appl. No. 17/851,945.
Application 17/851,945 is a continuation of application No. 17/179,985, filed on Feb. 19, 2021, granted, now 11,410,671.
Claims priority of application No. 2020-028730 (JP), filed on Feb. 21, 2020; application No. 2020-028731 (JP), filed on Feb. 21, 2020; and application No. 2020-033406 (JP), filed on Feb. 28, 2020.
Prior Publication US 2022/0328059 A1, Oct. 13, 2022
Int. Cl. G10L 21/0232 (2013.01); G10L 15/06 (2013.01); H04R 3/00 (2006.01); H04R 1/40 (2006.01); G10L 21/0216 (2013.01)

CPC G10L 21/0232 (2013.01) [G10L 15/063 (2013.01); H04R 1/406 (2013.01); H04R 3/005 (2013.01); G10L 2015/0631 (2013.01); G10L 2021/02166 (2013.01)]

9 Claims

1. A speech processing device connectable to a sound collecting device disposed in a closed space, the speech processing device comprising:

a processor; and

a memory having instructions that, when executed by the processor, cause the processor to perform operations, the operations comprising:

acquiring speaking person position information, the speaking person position information indicating a positional relationship between the sound collecting device and each of a plurality of persons present in the closed space, the plurality of persons including a main speaking person;

estimating a mixing rate indicating a ratio of a speech signal of the main speaking person to a speech signal of another person other than the main speaking person based on the speaking person position information; and

determining whether suppression of a crosstalk component due to speaking of the another person contained in the speech signal of the main speaking person is necessary based on an estimation result of the mixing rate.