US 12,079,222 B1
Enabling data portability between systems
Emily Potyraj, Houston, TX (US); Joshua Robinson, Madrid (ES); and Brian Gold, Los Altos, CA (US)
Assigned to PURE STORAGE, INC., Santa Clara, CA (US)
Filed by PURE STORAGE, INC., Mountain View, CA (US)
Filed on Jan. 28, 2021, as Appl. No. 17/160,623.
Claims priority of provisional application 63/078,012, filed on Sep. 14, 2020.
Claims priority of provisional application 63/074,821, filed on Sep. 4, 2020.
Int. Cl. G06F 16/2457 (2019.01); G06F 16/248 (2019.01); G06F 16/27 (2019.01)
CPC G06F 16/24573 (2019.01) [G06F 16/248 (2019.01); G06F 16/275 (2019.01)] 20 Claims
OG exemplary drawing
 
1. A method implemented on a multi-site deployment including a plurality of storage systems coupled to a plurality of on-premises cloud infrastructures in a plurality of different physical locations that provide cloud infrastructure with localized storage resources connected with data communications to deliver cloud-based storage services in an on-premises storage system, the method comprising:
storing data in the plurality of storage systems coupled to the plurality of on-premises cloud infrastructures in the plurality of different physical locations that deliver cloud-based storage services in the on-premises storage system, wherein a remote cloud-based metadata store implemented in a cloud-computing environment stores centralized metadata describing data stored by the plurality of on-premises cloud infrastructures;
submitting, via a first storage system coupled to an on-premises cloud infrastructure at a first physical location that accesses cloud-based storage, a metadata search query to the remote cloud-based metadata store;
based on metadata responsive to the metadata search query indicating data stored in at least one particular storage system received from the remote cloud-based metadata store, selecting a subset of data stored in at least one on-premises storage system coupled to at least one other on-premises cloud infrastructure in a second physical location; and
replicating, from the at least one on-premises storage system coupled to at least one other on-premises cloud infrastructure in the second physical location to the first storage system, a resulting subset of the data responsive to the metadata search query, providing a local dataset, at the first storage system, that includes the resulting subset of the data responsive to the metadata search query.