US 12,412,023 B2
Identifying and formatting headers for text content
Sagar Gollamudi, San Diego, CA (US); Vishank Bhatia, Sunnyvale, CA (US); Xu Zhong, Vermont South (AU); Thanh Long Duong, Point Cook (AU); Mark Johnson, Castle Cove (AU); Srinivasa Phani Kumar Gadde, Fremont, CA (US); and Vishal Vishnoi, Redwood City, CA (US)
Assigned to Oracle International Corporation, Redwood Shores, CA (US)
Filed by Oracle International Corporation, Redwood Shores, CA (US)
Filed on Apr. 30, 2024, as Appl. No. 18/650,928.
Application 18/650,928 is a continuation of application No. 18/334,238, filed on Jun. 13, 2023, granted, now 12,001,775.
Prior Publication US 2024/0419886 A1, Dec. 19, 2024
Int. Cl. G06F 40/103 (2020.01)
CPC G06F 40/103 (2020.01) 30 Claims
OG exemplary drawing
 
1. One or more non-transitory computer-readable media comprising computer-executable instructions that, when executed by one or more processors, cause performance of operations, comprising:
identifying, in a data corpus, a first candidate text string for header classification;
determining a first font of the first candidate text string;
classifying the first candidate text string as a first header based at least in part on a first evaluation of the first font of the first candidate text string relative to one or more additional text strings in the data corpus;
applying, to the first header, a first header tag at least in part responsive to classifying the first candidate text string as the first header;
rendering the first header for display as a heading for a first set of related text strings in the data corpus.