| CPC G06F 18/2148 (2023.01) [G06F 18/217 (2023.01); G06F 18/2415 (2023.01); G06F 40/117 (2020.01); G06F 40/30 (2020.01); G06V 30/413 (2022.01); G06F 16/33 (2019.01); G06V 2201/10 (2022.01)] | 20 Claims |

|
1. A method comprising:
accessing an input document;
accessing a trained segmentation-and-labeling model configured by an accompanying training parameter set to predict segment bounds and classify resulting predicted segments into one or more topics;
generating a predicted segment for the input document by applying a segmentation network of the trained segmentation-and-labeling model to an encoded text portion from the input document;
generating a topic for the predicted segment by applying a pooling network of the trained segmentation-and-labeling model to the predicted segment;
assembling, using the predicted segment and the topic, an output document including the input document, segment metadata identifying the predicted segment, and topic metadata identifying the topic;
storing or displaying the output document; and
reconfiguring one of the segmentation network or the pooling network using an updated training parameter set, while leaving the other of the segmentation network or the pooling network configured by original training parameters from the accompanying training parameter set.
|