US 11,989,511 B2
System and method for term disambiguation
Daniel Riskin, Menlo Park, CA (US); and Anand Shroff, San Carlos, CA (US)
Assigned to Verantos, Inc., Menlo Park, CA (US)
Filed by Verantos, Inc., Menlo Park, CA (US)
Filed on Jun. 27, 2023, as Appl. No. 18/341,896.
Application 18/341,896 is a continuation of application No. 17/959,099, filed on Oct. 3, 2022, granted, now 11,727,208.
Application 17/959,099 is a continuation of application No. 17/581,498, filed on Jan. 21, 2022, granted, now 11,494,557, issued on Nov. 8, 2022.
Claims priority of provisional application 63/189,340, filed on May 17, 2021.
Prior Publication US 2023/0342548 A1, Oct. 26, 2023
This patent is subject to a terminal disclaimer.
Int. Cl. G06F 40/247 (2020.01); G06F 40/40 (2020.01); G06N 5/022 (2023.01)
CPC G06F 40/247 (2020.01) [G06F 40/40 (2020.01); G06N 5/022 (2013.01)] 7 Claims
OG exemplary drawing
 
1. A computer-implemented method, comprising:
identifying a plurality of initial concept groups from medical records or medical literature, wherein each initial concept group comprises clinical concepts and potential meanings of the clinical concepts;
constructing a knowledge set, using machine learning, the plurality of initial concept groups, wherein the constructing comprises:
(i) evaluating associations between the clinical concepts and the potential meanings of the clinical concepts, the associations comprising:
(a) a strength between each clinical concept and corresponding potential meanings weighted based on a token distance between occurrences of the associated concept and each potential meaning,
(b) a frequency of occurrence of each potential meaning of the clinical concepts, and
(c) directionality between associated clinical concepts, wherein the directionality indicates whether the occurrence of one clinical concept more likely leads to the occurrence of the other clinical concept than the reverse; and
(ii) filtering the plurality of initial concept groups based on the evaluation; and
storing the knowledge set into a database for deployment in disambiguating terms in an information source.