| CPC G06Q 50/26 (2013.01) [G06F 3/0482 (2013.01); G06F 3/04847 (2013.01); G06F 16/288 (2019.01); G06F 16/3331 (2019.01); G06F 40/205 (2020.01); G06F 40/263 (2020.01); G06F 40/30 (2020.01); G06N 5/022 (2013.01); G06N 5/04 (2013.01); G06Q 10/06375 (2013.01); G06Q 50/18 (2013.01); G06F 16/95 (2019.01); G06N 20/00 (2019.01); G06Q 2230/00 (2013.01); G06T 11/206 (2013.01); G06T 2200/24 (2013.01)] | 17 Claims |

|
1. A text analytics system for predicting whether a policy will be adopted, the system comprising:
at least one processor configured to:
scrape information from a plurality of sources on the Internet by a web crawler and an extraction bot, wherein the web crawler is configured to perform functions of finding, indexing, and fetching information from the plurality of sources on the Internet, and wherein the extraction bot is configured to perform processing on the information from the plurality of sources to generate text data, the text data being associated with comments authored by a plurality of individuals about a proposed policy;
generate a policy feature vector based on at least one of words, phrases, sentences, paragraphs, pages, metadata, linguistic patterns, syntactic parsing, or tone associated with the proposed policy;
generate a plurality of comment feature vectors based on at least one of words, phrases, sentences, paragraphs, pages, metadata, linguistic patterns, syntactic parsing, or tone associated with the text data, wherein each of the comment feature vectors is associated with at least one of the comments;
train, using a training set of the text data and an association between each comment feature vector and an output sentiment, a sentiment model for determining comment sentiments, wherein the sentiment model comprises weights computed for one or more input features of the plurality of comment feature vectors using machine learning, each weight reflecting an importance of the-one or more input features;
determine, based on application of the sentiment model to the text data, a sentiment of each comment;
train, using a training set of at least one of the plurality of comment feature vectors, the policy feature vector, and a policy outcome, a policy adoption model to determine the likelihood of adoption of the policy, wherein the policy adoption model comprises weights computed for the one or more input features using machine learning, each weight reflecting an importance of the one or more input features;
determine, based on application of an influence filter to the text data, an influence metric associated with each comment, the influence metric comprising a value indicating a level of influence the author of each comment has on whether the policy is adopted, wherein applying the influence filter includes accessing a database of individual terms associated with a heightened degree of influence, wherein inclusion of one or more of the individual terms in a comment indicates the comment has a higher impact on whether the policy is adopted than if the one or more individual terms were not included;
weigh each comment based on the level of influence using the influence metric;
apply the policy adoption model to the plurality of comment feature vectors to determine a likelihood of adoption of the policy;
determine based on an aggregate of the weighted comments, a first indicator associated with the likelihood of adoption of the policy and a second indicator associated with the weighted comments; and
transmit the indicators to a system user; and
generate a graphical user interface associated with the policy, wherein the graphical user interface includes a comment summary region including representations of each of the comments authored by the plurality of individuals about the policy, wherein each of the representations include a name of an individual author of the comment and an indicator of the sentiment of the comment.
|