US 11,960,508 B2
Data stitching across federated data lakes
Raghu Rajendra Arur, Tumkur (IN)
Assigned to CISCO TECHNOLOGY, INC., San Jose, CA (US)
Filed by Cisco Technology, Inc., San Jose, CA (US)
Filed on Jan. 25, 2022, as Appl. No. 17/583,601.
Prior Publication US 2023/0237070 A1, Jul. 27, 2023
Int. Cl. G06F 16/28 (2019.01); G06F 16/242 (2019.01); G06F 16/2455 (2019.01); G06F 16/248 (2019.01)
CPC G06F 16/283 (2019.01) [G06F 16/244 (2019.01); G06F 16/2456 (2019.01); G06F 16/248 (2019.01)] 20 Claims
OG exemplary drawing
 
1. A method, comprising:
receiving, at a device in communication with a plurality of data lake sites, a federated data lake query;
determining, by the device and based on the federated data lake query, a plurality of data lake operator sets that each correspond to one of the plurality of data lake sites, wherein each of the plurality of data lake operator sets is used to establish a respective data pipeline for the federated data lake query;
selecting, by the device, a particular data lake site of the plurality of data lake sites as a destination for one or more data pipelines that are established for the federated data lake query; and
sending, by the device, the plurality of data lake operator sets that each correspond to one of the plurality of data lake sites to cause the plurality of data lake sites to send query results to the particular data lake site using the one or more data pipelines, wherein the particular data lake site is configured to stitch the query results for the federated data lake query.