| CPC G06F 18/23211 (2023.01) [G06F 18/2415 (2023.01); G06F 40/166 (2020.01); G06F 40/289 (2020.01); G06F 40/35 (2020.01)] | 20 Claims |

|
1. A method, in a data processing system, for automated generation of an electronic form for an electronic messaging system, the method comprising:
obtaining historical conversation logs from the electronic messaging system, wherein the historical conversation logs comprise a plurality of communication sequences corresponding to a communication interchange between a plurality of entities, and wherein the plurality of entities are associated with a plurality of levels of hierarchy within an organization;
clustering communication sequences within the historical conversation logs according to similarity of features to thereby generate clusters of communication sequences;
for each cluster in the clusters of communication sequences:
identifying, within each cluster, sequences of normalized utterances that are repeated across communication sequences of the cluster;
categorizing each sequence, in a set of the sequences of normalized utterances, as to whether each sequence can be represented as one or more electronic forms, wherein the categorizing comprises executing at least one of a rule-based classifier computing model on patterns of characteristics of the sequence to classify the sequence as either a first class of sequence for which the one or more electronic forms may be used to gather information gathered as part of the sequence or a second class of sequence for which the one or more electronic forms may not be used, wherein the second class of sequence has a length greater than a predetermined threshold length; and
extracting, for each communication sequence in the cluster, attributes and corresponding attribute values; and
generating, for at least one of the clusters, in response to at least one sequence of the at least one cluster being categorized as the first class of sequence, one or more electronic form data structures based on the attributes and corresponding attribute values extracted for each communication sequence in the at least one cluster, wherein generating the one or more electronic form data structures comprises generating a plurality of different electronic form data structures, wherein each different electronic form data structure corresponds to a different entity of the plurality of entities, and wherein different ones of the electronic form data structures in the plurality of different electronic form data structures are associated with different stages of a workflow sequence and are presented, by the electronic messaging system, to corresponding ones of the plurality of entities in accordance with execution of a corresponding stage of the workflow sequence in an order of the workflow sequence.
|