CPC G10L 15/063 (2013.01) [G10L 15/05 (2013.01); G10L 15/183 (2013.01); G06N 20/00 (2019.01); G10L 2015/0631 (2013.01)] | 20 Claims |
1. A method of unsupervised dialogue structure extraction, the method comprising:
receiving a training corpus containing at least one dialogue which includes a plurality of conversational turns, at least one conversational turn including a system response and a user utterance, the user utterance including a plurality of tokens;
determining, based on a classification model, one or more contiguous spans over one or more subsets of the plurality of tokens;
generating token span encodings in a representation space based on the one or more contiguous spans and a context including the user utterance;
assigning the token span encodings into separate slot groups based on a clustering of the token span encodings in the representation space and a pre-defined number of slot groups;
generating a dialogue structure based on a sequence of states corresponding to predicted tags associated with the slot groups; and
incorporating the dialogue structure with the at least one dialogue as training data for an intelligent dialogue agent.
|