US 11,893,044 B2
Recognizing unknown data objects
Mehul A Shah, Saratoga, CA (US); George Steven McPherson, Seattle, WA (US); Prajakta Datta Damle, San Jose, CA (US); Gopinath Duddi, San Jose, CA (US); and Anurag Windlass Gupta, Atherton, CA (US)
Assigned to Amazon Technologies, Inc., Seattle, WA (US)
Filed by Amazon Technologies, Inc., Seattle, WA (US)
Filed on Apr. 10, 2020, as Appl. No. 16/846,141.
Application 16/846,141 is a continuation of application No. 15/385,772, filed on Dec. 20, 2016, granted, now 10,621,210.
Claims priority of provisional application 62/426,573, filed on Nov. 27, 2016.
Prior Publication US 2020/0242135 A1, Jul. 30, 2020
Int. Cl. G06F 17/30 (2006.01); G06F 16/28 (2019.01); G06F 16/21 (2019.01); G06F 16/13 (2019.01); G06F 16/25 (2019.01)
CPC G06F 16/285 (2019.01) [G06F 16/13 (2019.01); G06F 16/211 (2019.01); G06F 16/254 (2019.01); G06F 16/289 (2019.01)] 20 Claims
OG exemplary drawing
 
1. A system, comprising:
at least one processor; and
a memory, that stores program instructions that when executed by the at least one processor, causes the at least one processor to implement a data catalog service, the data catalog service configured to:
receive, via an interface for a data catalog service, a classifier for a data schema;
determine, by the data catalog service, a storage location in a data store of an unknown data object based on a comparison of a list of one or more known objects with contents found in the storage location of the data store;
perform a recognition task by the data catalog service for the unknown data object that is stored in the data store, wherein to perform the recognition task, the data catalog service is configured to:
apply the classifier to generate a representation of a portion of the unknown data object that identifies the data schema of a plurality of possible data schemas as the data schema configured to obtain access to the entire unknown data object; and
store the identification of the data schema for the unknown data object in the data catalog service.