CPC G10L 15/32 (2013.01) [G06F 16/245 (2019.01); G06N 3/04 (2013.01); G10L 15/02 (2013.01); G10L 15/04 (2013.01); G10L 15/142 (2013.01); G10L 15/16 (2013.01); G10L 15/22 (2013.01); G10L 2015/088 (2013.01)] | 9 Claims |
1. A speech recognition device a processor configured to execute operations comprising:
performing first speech recognition processing using a first method on speech data of a conversation made by a plurality of speakers and outputs a speech recognition result for each of respective uttered speech segments of the plurality of speakers;
determining, on the basis of a result of the first speech recognition processing, a subject segment of the conversation, wherein the subject segment represents a segment of the speech data including a part of the conversation with utterances about a subject; and
performing second speech recognition processing using a second method higher in accuracy than the first method on speech data in the segment determined to be the subject segment by the determiner and outputs a speech recognition result as a subject text.
|