US 11,940,996 B2
Unsupervised discriminative facet generation for dynamic faceted search
Md Faisal Mahbub Chowdhury, Woodside, NY (US)
Assigned to International Business Machines Corporation, Armonk, NY (US)
Filed by International Business Machines Corporation, Armonk, NY (US)
Filed on Dec. 26, 2020, as Appl. No. 17/134,271.
Prior Publication US 2022/0207030 A1, Jun. 30, 2022
Int. Cl. G06F 16/242 (2019.01); G06N 20/00 (2019.01)
CPC G06F 16/2423 (2019.01) [G06N 20/00 (2019.01)] 18 Claims
OG exemplary drawing
 
1. A method comprising:
with a computerized search engine, retrieving a plurality of electronic documents relevant to a query;
obtaining, via computerized term embedding, from said retrieved documents, a plurality of most similar terms with respect to said query;
for each of said most similar terms, determining a pervasiveness score and a relevance score;
filtering out, from said most similar terms, those of said terms that are pervasive, based on said pervasiveness score, the pervasiveness score being a probability of a given term appearing in a document related to the given term in the plurality of electronic documents relevant to the query, those of said terms that are irrelevant, based on said relevance score, the relevance score for the given term being a measure of popularity of the given term computed using a rank of documents where the given term is observed in the plurality of electronic documents relevant to the query, and those of said terms that are redundant to improve a relevance of query terms and reduce a quantity of the query terms;
outputting a top number of terms remaining in said most similar terms after said filtering, based on similarity to said query, as discriminative facets; and
with said computerized search engine, retrieving an updated plurality of electronic documents relevant to said query by resubmitting said query with at least one of said discriminative facets.