US 11,734,329 B2
System and method for text categorization and sentiment analysis
Jonathan Kershaw, Penrith (GB); Ashley Unitt, Basingstoke (GB); and Alan McCord, Wakatipu Queenstown (NZ)
Assigned to VONAGE BUSINESS LIMITED, Basingstoke (GB)
Filed by NewVoiceMedia Ltd., Basingstoke (GB)
Filed on Feb. 18, 2020, as Appl. No. 16/794,162.
Application 16/794,162 is a continuation of application No. 16/283,447, filed on Feb. 22, 2019, granted, now 10,565,244.
Application 16/283,447 is a continuation in part of application No. 16/163,482, filed on Oct. 17, 2018, granted, now 10,311,454, issued on Jun. 4, 2019.
Application 16/163,482 is a continuation in part of application No. 15/675,420, filed on Aug. 11, 2017, granted, now 10,162,844, issued on Dec. 25, 2018.
Claims priority of provisional application 62/523,733, filed on Jun. 22, 2017.
Prior Publication US 2020/0265076 A1, Aug. 20, 2020
This patent is subject to a terminal disclaimer.
Int. Cl. G06F 7/00 (2006.01); G06F 16/35 (2019.01); G06F 16/34 (2019.01); G06F 40/30 (2020.01)
CPC G06F 16/353 (2019.01) [G06F 16/34 (2019.01); G06F 40/30 (2020.01)] 8 Claims
OG exemplary drawing
 
1. A system for categorization and sentiment analysis, comprising:
a chunk parser for
receiving input text; and
breaking the input text into chunks of text comprising words and phrases;
a chunk sentiment analyzer which receives the chunks of text from the chunk parser, assigns a sentiment to each chunk of text, and passes each chunk with its assigned sentiment to a deterministic rules engine;
the deterministic rules engine which:
categorizes each chunk of text into a first set of semantic categories using regular expression rules; and
for chunks of text where no regular expression rule is found for categorization into the first set of semantic categories, passes those chunks of text to a semantic similarity engine;
the semantic similarity engine which:
adds a vector to each chunk of text received from the deterministic rules engine representing the semantic characteristics of that chunk of text;
categorizes the chunks of text into a second set of semantic categories based on a threshold semantic distance from one or more category anchor vectors; and
for chunks of text where no match is found for categorization into the second set of semantic categories, passes those chunks of text to a semantic cluster discovery engine;
a semantic cluster discovery engine for categorizing chunks of text received from the semantic similarity engine into a third set of semantic categories based on their clustering relative to one another, for those chunks of text which do not fall within the threshold distance from any of the one or more category anchor vectors; and
a category comparator and integrator for
comparing the first, second, and third sets of semantic categories to identify contextual associations between the chunks of text in each semantic category; and
calculating a sentiment for the input text based on the contextual associations.