US 11,983,502 B2
Extracting fine-grained topics from text content
Deven Santosh Shah, San Jose, CA (US); Sukanya Moorthy, Sunnyvale, CA (US); and Topojoy Biswas, San Jose, CA (US)
Assigned to YAHOO AD TECH LLC, New York, NY (US)
Filed by YAHOO AD TECH LLC, Dulles, VA (US)
Filed on Nov. 24, 2021, as Appl. No. 17/534,502.
Prior Publication US 2023/0161964 A1, May 25, 2023
Int. Cl. G06F 40/284 (2020.01); G06F 40/166 (2020.01); G06F 40/30 (2020.01); G06F 40/40 (2020.01); G06N 3/08 (2023.01); G06F 40/242 (2020.01)
CPC G06F 40/30 (2020.01) [G06F 40/166 (2020.01); G06F 40/40 (2020.01); G06N 3/08 (2013.01); G06F 40/242 (2020.01); G06F 40/284 (2020.01)] 20 Claims
OG exemplary drawing
 
1. A method comprising:
generating a set of sentences based on a document;
predicting a set of labels for each sentence using a multi-label classifier, the multi-label classifier including a self-attended contextual word embedding backbone layer, a bank of trainable unigram convolutions, a bank of trainable bigram convolutions, and a fully connected layer, the multi-label classifier trained using a weakly labeled data set; and
labeling the document based on the set of labels.