| CPC G06F 11/3616 (2013.01) [G06F 8/60 (2013.01); G06F 11/3006 (2013.01); G06F 16/3344 (2019.01); G06F 16/93 (2019.01); G06F 16/951 (2019.01); G06F 40/35 (2020.01)] | 20 Claims |

|
1. A computer-implemented method for source knowledge generation, the method comprising:
identifying one or more external systems storing API (Application Programming Interface) documentation for accessing one or more data source systems;
crawling the one or more external systems to extract a plurality of documents, each document representing information describing APIs for accessing the one or more data source systems;
generating a plurality of vector representations by, for each document of the plurality of documents, providing the document to a neural network to generate a vector representation of the document;
generating an index that maps each vector representation to its respective document;
extracting information describing one or more aspects for a data connector by querying the index, receiving an indication of a set of matching documents, and extracting the information describing the data connector from the set of matching documents, the query corresponding to a particular type of data source; and
generating a connector representation based on the extracted information, wherein the connector representation is used for generating and deploying a new data connector for accessing data from a data source system of the particular type of data source.
|