US 12,001,491 B2
Method and system for automated public information discovery
Akshat Gupta, New York, NY (US); Simerjot Kaur, Jersey City, NJ (US); Xiaomo Liu, Manhasset, NY (US); Armineh Nourbakhsh, Pittsburgh, PA (US); Andrea Stefanucci, Hoboken, NJ (US); Alex Woodgate, New York, NY (US); and Sameena Shah, Scarsdale, NY (US)
Assigned to JPMORGAN CHASE BANK, N.A., New York, NY (US)
Filed by JPMorgan Chase Bank, N.A., New York, NY (US)
Filed on Feb. 1, 2022, as Appl. No. 17/649,617.
Prior Publication US 2023/0244724 A1, Aug. 3, 2023
Int. Cl. G06F 16/951 (2019.01); G06F 16/957 (2019.01)
CPC G06F 16/951 (2019.01) [G06F 16/957 (2019.01)] 20 Claims
OG exemplary drawing
 
19. A non-transitory computer readable storage medium storing instructions for extracting company-specific publicly accessible information, the storage medium comprising executable code which, when executed by a processor, causes the processor to:
receive first information that relates to an identification of at least one company;
determine, based on the first information, at least one publicly accessible data source via which second information that relates to the at least one company is available;
receive at least one user input that relates to a type of company-specific data to be accessed from the at least one publicly accessible data source;
retrieve, based on the received at least one user input, a subset of the second information,
wherein the retrieval of the subset of the second information comprises mining the at least one publicly accessible data source via a data aggregator, maximizing a bandwidth of a machine by parallelizing crawling tasks of the machine before distributing the crawling tasks to each of a plurality of machines that belong to a cluster, and using an artificial intelligence (AI)-based algorithm to extract the subset of the second information from the at least one publicly accessible data source based on the at least one user input, and
wherein the at least one publicly accessible data source comprises social media; and
output the subset of the second information.