US 11,886,429 B2
Persistent metadata catalog
George Steven McPherson, Seattle, WA (US); Yevgeniy Mikhaylyuta, Sammamish, WA (US); Tobin Daniel Baker, Everett, WA (US); and Richard Jeffrey Cole, Seattle, WA (US)
Assigned to Amazon Technologies, Inc., Seattle, WA (US)
Filed by Amazon Technologies, Inc., Reno, NV (US)
Filed on Nov. 25, 2020, as Appl. No. 17/104,392.
Application 17/104,392 is a continuation of application No. 14/310,422, filed on Jun. 20, 2014, granted, now 10,853,356.
Prior Publication US 2021/0173828 A1, Jun. 10, 2021
Int. Cl. G06F 21/62 (2013.01); G06F 16/24 (2019.01)
CPC G06F 16/24 (2019.01) [G06F 21/6218 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A system, comprising:
one or more computing devices configured to implement:
a data catalog comprising a central repository accessible to one or more data producers and one or more data consumers, the data catalog configured to:
store metadata regarding a plurality of data sources configured to store data, wherein the metadata includes a description, a location, a schema, and access information for accessing the data stored at respective ones of the plurality of data sources;
receive, from a data producer of the one or more data producers, a request to add new metadata of a new data source to the plurality of data sources of the data catalog;
add the new metadata for the new data source to the data catalog according to the request to add the new metadata, wherein the new metadata comprises information indicating a description, a location, and a schema for data stored at the new data source;
receive, from a data consumer of the one or more data consumers, a request to discover at least one data source of the plurality of data sources;
in response to the request to discover the at least one data source, provide the metadata for the at least one data source from the data catalog to the data consumer, wherein the data consumer is configured to connect to the at least one data source according to the provided metadata, and wherein the metadata provided from the data catalog indicates a location of a loadable software module configured to execute at the data consumer and access the at least one data source on behalf of the data consumer; and
provide the loadable software module to the data consumer.