US 12,411,869 B2
Data processing independent of storage, format or schema
Jacob Russell Gminder, Sanford, FL (US)
Assigned to HITACHI VANTARA LLC, Santa Clara, CA (US)
Appl. No. 18/036,198
Filed by HITACHI VANTARA LLC, Santa Clara, CA (US)
PCT Filed Nov. 23, 2020, PCT No. PCT/US2020/061718
§ 371(c)(1), (2) Date May 10, 2023,
PCT Pub. No. WO2022/108599, PCT Pub. Date May 27, 2022.
Prior Publication US 2023/0418842 A1, Dec. 28, 2023
Int. Cl. G06F 16/28 (2019.01); G06F 16/2457 (2019.01); G06F 16/25 (2019.01)
CPC G06F 16/285 (2019.01) [G06F 16/24573 (2019.01); G06F 16/254 (2019.01)] 13 Claims
OG exemplary drawing
 
1. A system comprising:
a first computing device comprising a processor configured to communicate over a network with a plurality of data cluster computing devices associated with a plurality of data clusters storing data resources, and configured to manage metadata associated with the stored data resources, the first computing device executing a single instance of an application to perform operations comprising:
obtaining parameters indicating first metadata, and second metadata in a respective data cluster;
searching the metadata using the first metadata as a search query and returning metadata as a first search result;
obtaining a first location of one or more data resources in a respective data cluster based on the returned metadata in the first search result;
reading contents of the one or more data resources from the first location of the respective data cluster;
searching the metadata using the second metadata as a search query and returning metadata as a second search result;
obtaining a second location in a respective data cluster based on the returned metadata in the second search result;
writing the read contents of the one or more data resources to the obtained second location based on the second search result;
wherein the first computing device is configured to execute a catalog service that manages the data resources, and metadata associated with each data resource, and resource identifiers identifying each data resource,
wherein the obtained parameters include a parameter indicating whether to maintain a folder structure,
wherein the data resources include a first virtual folder and a second virtual folder,
wherein the first metadata in the obtained parameters is a resource identifier of the first virtual folder,
wherein the second metadata in the obtained parameters is a name of the virtual folder, and
wherein the obtained second location is a file path of the second virtual folder determined based on the returned metadata in the second search result, the operations further comprising:
reading the contents of the one or more data resources from the first virtual folder at the first location of the respective data cluster;
determining whether the parameter indicating whether to maintain a folder structure indicates to maintain a folder structure;
upon determining the parameter indicating whether to maintain a folder structure indicates to maintain a folder structure, writing the read contents to the obtained second location of the second virtual folder according to a folder structure of the first folder; and
upon determining the parameter indicating whether to maintain a folder structure does not indicate to maintain a folder structure, writing the read contents to the obtained second location of the second virtual folder without maintaining the folder structure of the first folder.