| CPC G06T 13/40 (2013.01) [G06N 3/006 (2013.01); G06V 40/161 (2022.01); G06V 40/172 (2022.01); G10L 17/22 (2013.01); H04R 1/406 (2013.01); H04R 3/005 (2013.01); H04R 2201/401 (2013.01)] | 20 Claims |

|
1. An artificial intelligence (AI) avatar-based interaction service method performed in a system including an unmanned information terminal and an interaction service device, the method comprising:
transmitting a sound signal collected from a microphone array mounted in the unmanned information terminal and an image signal collected from a vision sensor to the interaction service device;
setting a sensing area based on a received sound signal and image signal by the interaction service device;
recognizing an active speaker based on a voice signal of a user and an image signal of the user collected in the sensing area, by the interaction service device;
determining a voice information based on the voice signal in response to recognizing the active speaker;
determining a non-verbal information based on the image signal in response to recognizing the active speaker;
generating a response for the recognized active speaker, 3D rendering an artificial intelligence avatar, said artificial intelligence avatar reflecting a desired response, wherein generating the response includes applying a first weight to the voice information and a second weight to the non-verbal information in response to the voice information and the non-verbal information having consistent results, and applying a third weight different than the first weight to the voice information and a fourth weight different than the second weight to the non-verbal information in response to the voice information and the non-verbal information having inconsistent results; and
using the interaction service device to provide the rendered artificial intelligence avatar to the unmanned information terminal.
|