CPC G10L 15/16 (2013.01) [G06N 3/08 (2013.01); G10L 15/183 (2013.01); G10L 15/22 (2013.01)] | 20 Claims |
1. A computer-implemented method, comprising:
receiving, by at least one server communicatively coupled with a user device, a dialogue utterance;
applying, by the at least one server, a generative adversarial network (GAN) to the dialogue utterance to generate response candidates to the utterance and determine a response from the response candidates to respond to the utterance, the GAN comprising a generator and a discriminator, and wherein the applying the GAN comprises:
generating, by the generator, utilizing source attributes and target attributes, the response candidates responsive to the dialogue utterance, wherein the source attributes comprise a speaker identity, a speaker background, a speaker location, a speaker preference, a speaker sentiment, or combination thereof, and the target attributes comprise a respondent identity, a respondent background, a respondent location, a respondent preference, a respondent sentiment, or a combination thereof;
determining, by the discriminator, the response to respond to the dialogue utterance from the response candidates based on discrimination metrics comprising human-likeness and persona, the discriminator comprising an attribute discriminator to utilize the target attributes as a discriminator target and a dialogue history for multi-label attribute classification to classify outputs from the generator with an attribute class, and an adversarial discriminator to determine a binary output for human-likeness to identify the response candidates as real or fake; and
causing, by the at least one server, communication of the response to the user device responsive to the dialogue utterance.
|