US 12,367,867 B2
System for generating voice in an ongoing call session based on artificial intelligent techniques
Sandeep Singh Spall, Moga (IN); Tarun Gupta, Noida (IN); and Narang Lucky Manoharlal, Noida (IN)
Assigned to SAMSUNG ELECTRONICS CO., LTD., Suwon-si (KR)
Filed by SAMSUNG ELECTRONICS CO., LTD., Suwon-si (KR)
Filed on Oct. 25, 2022, as Appl. No. 17/973,266.
Claims priority of application No. 202111048934 (IN), filed on Oct. 26, 2021.
Prior Publication US 2023/0130777 A1, Apr. 27, 2023
Int. Cl. G10L 15/16 (2006.01); G10L 15/02 (2006.01)
CPC G10L 15/16 (2013.01) [G10L 2015/025 (2013.01)] 14 Claims
OG exemplary drawing
 
1. A method of generating voice in a call session, the method comprising:
extracting a plurality of features from a voice input through an artificial neural network (ANN);
identifying one or more lost audio frames within the voice input, wherein the one or more lost audio frames are lost due to at least one of vocal issues or network issues;
predicting by the ANN, for each of the one or more lost audio frames, one or more features of the respective lost audio frame;
superposing the predicted features upon the voice input to generate an updated voice input; and
correcting the updated voice input by:
obtaining a confidence score of the updated voice input;
splitting the updated voice input into a plurality of phonemes based on the confidence score;
identifying one or more non-aligned phonemes out of the plurality of phonemes based on comparing the plurality of phonemes with language vocabulary knowledge;
generating a plurality of variant phonemes; and
updating the identified one or more non-aligned phonemes through one or more of:
replacing the identified one or more non-aligned phonemes with the plurality of variant phonemes;
adding additional phonemes to supplement the identified one or more non-aligned phonemes; or
deleting the identified one or more non-aligned phonemes.