US 12,411,869 B2
	Data processing independent of storage, format or schema
Jacob Russell Gminder, Sanford, FL (US)
Assigned to HITACHI VANTARA LLC, Santa Clara, CA (US)
Appl. No. 18/036,198
Filed by HITACHI VANTARA LLC, Santa Clara, CA (US)
PCT Filed Nov. 23, 2020, PCT No. PCT/US2020/061718 § 371(c)(1), (2) Date May 10, 2023, PCT Pub. No. WO2022/108599, PCT Pub. Date May 27, 2022.
Prior Publication US 2023/0418842 A1, Dec. 28, 2023
Int. Cl. G06F 16/28 (2019.01); G06F 16/2457 (2019.01); G06F 16/25 (2019.01)

CPC G06F 16/285 (2019.01) [G06F 16/24573 (2019.01); G06F 16/254 (2019.01)]

13 Claims

1. A system comprising:

a first computing device comprising a processor configured to communicate over a network with a plurality of data cluster computing devices associated with a plurality of data clusters storing data resources, and configured to manage metadata associated with the stored data resources, the first computing device executing a single instance of an application to perform operations comprising:

obtaining parameters indicating first metadata, and second metadata in a respective data cluster;

searching the metadata using the first metadata as a search query and returning metadata as a first search result;

obtaining a first location of one or more data resources in a respective data cluster based on the returned metadata in the first search result;

reading contents of the one or more data resources from the first location of the respective data cluster;

searching the metadata using the second metadata as a search query and returning metadata as a second search result;

obtaining a second location in a respective data cluster based on the returned metadata in the second search result;

writing the read contents of the one or more data resources to the obtained second location based on the second search result;

wherein the first computing device is configured to execute a catalog service that manages the data resources, and metadata associated with each data resource, and resource identifiers identifying each data resource,

wherein the obtained parameters include a parameter indicating whether to maintain a folder structure,

wherein the data resources include a first virtual folder and a second virtual folder,

wherein the first metadata in the obtained parameters is a resource identifier of the first virtual folder,

wherein the second metadata in the obtained parameters is a name of the virtual folder, and

wherein the obtained second location is a file path of the second virtual folder determined based on the returned metadata in the second search result, the operations further comprising:

reading the contents of the one or more data resources from the first virtual folder at the first location of the respective data cluster;

determining whether the parameter indicating whether to maintain a folder structure indicates to maintain a folder structure;

upon determining the parameter indicating whether to maintain a folder structure indicates to maintain a folder structure, writing the read contents to the obtained second location of the second virtual folder according to a folder structure of the first folder; and

upon determining the parameter indicating whether to maintain a folder structure does not indicate to maintain a folder structure, writing the read contents to the obtained second location of the second virtual folder without maintaining the folder structure of the first folder.