US 12,277,637 B2
	AI avatar-based interaction service method and apparatus
Han Seok Ko, Seoul (KR); Jeong Min Bae, Seoul (KR); and Miguel Alba, Seoul (KR)
Assigned to DATUM POINT LABS, INC., Jackson, WY (US)
Filed by DMLab. CO., LTD., Seoul (KR)
Filed on Feb. 11, 2022, as Appl. No. 17/669,666.
Claims priority of application No. 10-2021-0034756 (KR), filed on Mar. 17, 2021; and application No. 10-2022-0002347 (KR), filed on Jan. 6, 2022.
Prior Publication US 2022/0301251 A1, Sep. 22, 2022
Int. Cl. G06T 13/40 (2011.01); G06N 3/006 (2023.01); G06V 40/16 (2022.01); G10L 17/22 (2013.01); H04R 1/40 (2006.01); H04R 3/00 (2006.01)

CPC G06T 13/40 (2013.01) [G06N 3/006 (2013.01); G06V 40/161 (2022.01); G06V 40/172 (2022.01); G10L 17/22 (2013.01); H04R 1/406 (2013.01); H04R 3/005 (2013.01); H04R 2201/401 (2013.01)]

20 Claims

1. An artificial intelligence (AI) avatar-based interaction service method performed in a system including an unmanned information terminal and an interaction service device, the method comprising:

transmitting a sound signal collected from a microphone array mounted in the unmanned information terminal and an image signal collected from a vision sensor to the interaction service device;

setting a sensing area based on a received sound signal and image signal by the interaction service device;

recognizing an active speaker based on a voice signal of a user and an image signal of the user collected in the sensing area, by the interaction service device;

determining a voice information based on the voice signal in response to recognizing the active speaker;

determining a non-verbal information based on the image signal in response to recognizing the active speaker;

generating a response for the recognized active speaker, 3D rendering an artificial intelligence avatar, said artificial intelligence avatar reflecting a desired response, wherein generating the response includes applying a first weight to the voice information and a second weight to the non-verbal information in response to the voice information and the non-verbal information having consistent results, and applying a third weight different than the first weight to the voice information and a fourth weight different than the second weight to the non-verbal information in response to the voice information and the non-verbal information having inconsistent results; and

using the interaction service device to provide the rendered artificial intelligence avatar to the unmanned information terminal.