| CPC G16H 50/20 (2018.01) [G06F 40/205 (2020.01); G06N 20/00 (2019.01); G16H 70/60 (2018.01); G16H 10/60 (2018.01); G16H 30/20 (2018.01)] | 20 Claims |

|
1. A medical text summarization system, comprising:
a non-transitory memory device for storing computer readable program code; and
a processor device in communication with the memory device, the processor device being operative with the computer readable program code to perform steps including
receiving training histories of present illness, reference medical documents and medical text corpora,
training, based on the training histories of present illness and the reference medical documents, an extractor that selects one or more relevant sentences from the training histories of present illness, wherein the extractor comprises a reinforcement learning agent,
pre-training, based on the training histories of present illness and the reference medical documents, an abstractor that generates one or more first reasons for study from the one or more relevant sentences selected by the extractor, wherein the one or more first reasons for study comprise one or more paraphrases of the one or more relevant sentences,
pre-training an entity linking system using the medical text corpora to map one or more mentions in the one or more first reasons for study to one or more standardized entities for predicting one or more diagnoses,
re-training, based on the training histories of present illness and the reference medical documents, the reinforcement learning agent using one or more rewards generated by the entity linking system by evaluating quality of the one or more first reasons for study, and
generating one or more second reasons for study from a current history of present illness using the trained extractor, the pre-trained abstractor and the pre-trained entity linking system.
|