US 11,688,416 B2
Method and system for speech emotion recognition
Yatish Jayant Naik Raikar, Bangalore (IN); Varunkumar Tripathi, Karnataka (IN); Kiran Chittella, Bangalore (IN); and Vinayak Kulkarni, Nargund (IN)
Assigned to DISH Network Technologies India Private Limited
Filed by SLING MEDIA PVT LTD, Bangaluru (IN)
Filed on Aug. 30, 2021, as Appl. No. 17/446,385.
Application 17/446,385 is a continuation of application No. 16/677,324, filed on Nov. 7, 2019, granted, now 11,133,025.
Prior Publication US 2021/0390973 A1, Dec. 16, 2021
This patent is subject to a terminal disclaimer.
Int. Cl. G10L 25/63 (2013.01); G10L 15/02 (2006.01); G10L 15/22 (2006.01); G10L 15/26 (2006.01)
CPC G10L 25/63 (2013.01) [G10L 15/02 (2013.01); G10L 15/22 (2013.01); G10L 15/26 (2013.01); G10L 2015/027 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A method of enriching speech to text communications between users in a speech chat session using speech emotion recognition, the method comprising:
applying a set of rules for speech emotion recognition to convert observed emotions in a speech sample and to enrich text with visual emotion content in the speech to text communications by:
generating a data set of speech samples with labels of a plurality of emotion classes;
selecting a set of acoustic features from each of the plurality of emotion classes;
applying the set of rules for the speech emotion recognition based on at least one of the selected set of acoustic features and the data set of speech samples;
computing a number of rules that have been satisfied from the set of rules for the speech emotion recognition that have applied to the selected set of acoustic features to label the selected set of acoustic features; and
presenting the enriched text in speech-to-text communications in accordance with labeled acoustic features between users in the speech chat session for visual notice of the observed emotions in the speech sample.