CPC G10L 15/22 (2013.01) [G10L 15/02 (2013.01); G10L 15/10 (2013.01)] | 20 Claims |
1. An electronic device comprising:
a receiver receiving a speech signal;
memory storing one or more computer programs; and
one or more processors communicatively coupled to the receiver and the memory,
wherein the one or more computer programs include computer-executable instructions that, when executed by the one or more processors, cause the electronic device to:
control the receiver to receive the speech signal,
determine whether the speech signal comprises speech signals of a plurality of different speakers,
in response to determining that the speech signal comprises the speech signals of the plurality of different speakers, detect feature information from a speech signal of each speaker,
based on the feature information, determine relationships between speech content of the plurality of different speakers,
based on the determined relationships between the speech content of the plurality of different speakers, determine that speech content of a first speaker among the plurality of different speakers and speech content of a second speaker among the plurality of different speakers are generated in a same speech domain and that conflicts occur between the speech content of the first speaker and the speech content of the second speaker, and
based on the determining that conflicts occur and the determined relationships between the speech content of the plurality of different speakers, control the electronic device and at least one other electronic device to perform an operation corresponding to each speech content of the plurality of different speakers.
|