CPC G06F 40/35 (2020.01) [G06N 20/00 (2019.01); H04L 51/02 (2013.01); G06F 40/205 (2020.01); G06F 40/253 (2020.01)] | 24 Claims |
1. A method comprising:
receiving, by a chatbot system, an utterance generated by a user interacting with the chatbot system;
inputting, by the chatbot system, the utterance into a machine-learning model comprising a set of binary classifiers, wherein each binary classifier of the set of binary classifiers: (i) is configured to estimate a probability that the utterance corresponds to a class of a set of classes; (ii) is associated with a modified logit function that transforms the probability for the class into a real number, wherein the modified logit function is a logarithm of odds corresponding to the probability for the class, the logarithm of odds being determined based on a distance measured between the probability for the class and a centroid of a distribution associated with the class;
generating, by the machine-learning model, a set of distance-based logit values for the utterance, wherein each distance-based logit value of the set of distance-based logit values is generated by:
determining, by a respective binary classifier of the set of binary classifiers, a respective probability that the utterance corresponds to a class associated with the respective binary classifier; and
mapping, by the respective binary classifier and based on the modified logit function, the respective probability to the distance-based logit value, wherein the mapping includes using a respective distance measured between the respective probability and a centroid of a distribution associated with the class associated with the respective binary classifier;
applying, by the machine-learning model, an enhanced activation function to the set of distance-based logit values to generate a predicted output, wherein the predicted output identifies a normalized probability predictive of whether the utterance corresponds to a particular class of the set of classes within a probability distribution, and wherein the enhanced activation function includes a learned parameter for normalizing an initial output of the enhanced activation function to determine the normalized probability; and
classifying, by the chatbot system and based on the predicted output, the utterance as being associated with the particular class.
|