CPC G06F 16/345 (2019.01) [G06F 40/166 (2020.01); G06N 20/00 (2019.01); G06F 40/117 (2020.01); G06F 40/279 (2020.01)] | 20 Claims |
1. A method for abstractive summarization of a document, the method comprising:
receiving, via a data interface, a training dataset comprising a plurality of articles and a plurality of summaries corresponding to the plurality of articles;
generating a plurality of article-summary pairs by pairing each article with at least one associated summary;
computing, for an article-summary pair, an entity coverage precision metric based on a number of entity mentions in a corresponding summary or a corresponding article;
determining a pseudo label indicating a faithfulness level of the corresponding article and the corresponding summary based on the computed entity coverage precision metric;
prepending the article with the determined pseudo label as a training input to a summarization model;
generating, by the summarization model, an output summary conditioned on both the article and the prepended pseudo label; and
updating the summarization model based on a training objective comparing the output summary and the corresponding summary.
|