CPC G06F 40/30 (2020.01) [G06F 40/289 (2020.01); G06N 20/00 (2019.01); H04L 51/02 (2013.01); G06F 40/205 (2020.01)] | 20 Claims |
1. A computer-implemented method comprising:
accessing an utterance and a target domain of a chatbot;
generating a sentence embedding for the utterance;
predicting a first probability as to whether the utterance belongs to the target domain of the chatbot based on the sentence embedding for the utterance and a distance or density deviation between the sentence embedding and an embedding representation for a cluster of a plurality of clusters of in-domain utterances associated with the target domain of the chatbot;
predicting a second probability as to whether the utterance belongs to the target domain of the chatbot based on the sentence embedding for the utterance and a similarity or difference between the sentence embedding and an embedding representation for a cluster of the plurality of clusters of in-domain utterances associated with the target domain of the chatbot;
determining, based on the first probability and the second probability, a final probability as to whether the utterance belongs to the target domain of the chatbot; and
classifying the utterance as in-domain or out-of-domain for the chatbot based on the final probability.
|