CPC G06F 16/951 (2019.01) [G06F 16/957 (2019.01)] | 20 Claims |
19. A non-transitory computer readable storage medium storing instructions for extracting company-specific publicly accessible information, the storage medium comprising executable code which, when executed by a processor, causes the processor to:
receive first information that relates to an identification of at least one company;
determine, based on the first information, at least one publicly accessible data source via which second information that relates to the at least one company is available;
receive at least one user input that relates to a type of company-specific data to be accessed from the at least one publicly accessible data source;
retrieve, based on the received at least one user input, a subset of the second information,
wherein the retrieval of the subset of the second information comprises mining the at least one publicly accessible data source via a data aggregator, maximizing a bandwidth of a machine by parallelizing crawling tasks of the machine before distributing the crawling tasks to each of a plurality of machines that belong to a cluster, and using an artificial intelligence (AI)-based algorithm to extract the subset of the second information from the at least one publicly accessible data source based on the at least one user input, and
wherein the at least one publicly accessible data source comprises social media; and
output the subset of the second information.
|