US 11,853,901 B2
Learning method of AI model and electronic apparatus
Kwangwon Ko, Suwon-si (KR); and Jongchul Park, Suwon-si (KR)
Assigned to SAMSUNG ELECTRONICS CO., LTD., Suwon-si (KR)
Filed by SAMSUNG ELECTRONICS CO., LTD., Suwon-si (KR)
Filed on Jul. 24, 2020, as Appl. No. 16/938,593.
Claims priority of application No. 10-2019-0090963 (KR), filed on Jul. 26, 2019.
Prior Publication US 2021/0027172 A1, Jan. 28, 2021
Int. Cl. G06N 3/02 (2006.01); G06N 5/02 (2023.01)
CPC G06N 5/02 (2013.01) [G06N 3/02 (2013.01)] 4 Claims
OG exemplary drawing
 
1. A controlling method of an electronic apparatus for training an artificial intelligence (AI) model configured to provide information identifying a recommendation item and a recommended user, the method comprising:
obtaining first user data and first item data at a first time interval;
obtaining at least one first keyword for the first user data and at least one second keyword for the first item data;
generating a first semantic vector at a second time interval by inputting at least one keyword for the first user data to a deep structured semantic model (DSSM);
generating a second semantic vector at the second time interval by inputting at least one keyword for the first item data to the DSSM;
generating a first vector that represents a relevance between the first semantic vector and the second semantic vector at the second time interval;
storing data corresponding to the generated first vector, the first semantic vector, and the second semantic vector;
obtaining an updated weight for a first AI model by training the first AI model based on the stored data;
applying the updated weight to a second AI model;
providing the information identifying the recommendation item or the recommended user based on the second AI model; and
re-updating the updated weight of the first AI model based on a user interaction of user,
wherein providing the information identifying the recommendation item or the recommended user based on the second AI model comprises:
obtaining at least one among second user data and second item data in real time;
based on the second user data being obtained in real time, obtaining at least one third keyword for the second user data;
generating a third semantic vector by inputting the at least one third keyword for the second user data to the DSSM;
generating a fourth semantic vector by inputting the third semantic vector to the second AI model;
providing the information identifying the recommendation item based on the fourth semantic vector through a display of the electronic apparatus;
receiving the user interaction of user corresponding to the second user data associated with the recommendation item through an input interface of the electronic apparatus;
generating a second vector that represents a relevance between the third semantic vector and the fourth semantic vector;
based on the second item data being obtained in real time, obtaining at least one fourth keyword for the second item data;
generating a fifth semantic vector by inputting the at least one fourth keyword for the second item data to the DSSM;
generating a sixth semantic vector by inputting the fifth semantic vector to the second AI model;
providing the information identifying the recommended user based on the sixth semantic vector through the display of the electronic apparatus;
providing the recommended user with information of the recommendation item corresponding to the fifth semantic vector through the display of the electronic apparatus;
receiving the user interaction of the recommended user associated with the recommendation item through the input interface of the electronic apparatus; and
generating a third vector that represents a relevance between the fifth semantic vector and the sixth semantic vector,
wherein the re-updating the updated weight of the first AI model based on the user interaction comprises:
re-updating the updated weight of the first AI model based on the user interaction of user corresponding to the second user data, the third semantic vector, the fourth semantic vector, and the second vector; and
re-updating the updated weight of the first AI model based on the user interaction of the recommended user, the fifth semantic vector, the sixth semantic vector, and the third vector,
wherein the providing the information identifying the recommendation item comprises displaying an advertising content of the recommendation item through the display of the electronic apparatus,
wherein the providing the information identifying the recommended user comprises displaying a list of the recommended user through the display of the electronic apparatus,
wherein the providing the recommended user with information of the recommendation item comprises displaying an advertising content of the recommendation item corresponding to the fifth semantic vector through the display of the electronic apparatus,
wherein the re-updating the updated weight of the first AI model comprises training the first AI model by performing a reinforcement learning based on state data, action data and reward data, and obtaining the re-updated weight for the first AI model,
wherein the state data comprises the third semantic vector and the fifth semantic vector, the action data comprises the fourth semantic vector and the sixth semantic vector, and the reward data comprises the user interaction of user corresponding to the second user data and the user interaction of the recommended user,
wherein the first time interval is greater than the second time interval, and
wherein the first time interval is predetermined according to a category of the item.