US 12,469,483 B1
Comprehensive AI-enabled systems for immersive voice, companion, and augmented/virtual reality interaction solutions
Bao Tran, Saratoga, CA (US)
Filed by Bao Tran, Saratoga, CA (US)
Filed on Apr. 1, 2025, as Appl. No. 19/097,095.
Int. Cl. G10L 15/26 (2006.01); G10L 13/08 (2013.01); G10L 15/18 (2013.01)
CPC G10L 13/08 (2013.01) [G10L 15/1822 (2013.01); G10L 15/26 (2013.01)] 21 Claims
OG exemplary drawing
 
1. A computer-implemented method for operating an artificial intelligence voice agent system, comprising:
receiving voice input through communication channels;
analyzing converted text through natural language processing (NLP) pipelines implementing intent recognition and sentiment analysis detecting emotional cues using a multimodal large language model (LLM);
generating response content using machine learning models trained on domain-specific corpora;
converting generated responses to synthetic speech through text-to-speech (TTS) engines;
integrating a customer relationship management (CRM) platform with an enterprise resource planning (ERP) database;
dynamic latency compensation maintaining sub-300 ms response times during voice interactions;
and implementing continuous learning by updating language understanding models using conversation logs, voice recognition parameters based on user feedback, and response generation patterns.