Skip to main content

Multi Data Lake

Analyses with global distributed Data
Multiple Data Lakes with multiple MDMs

 

 

Multi Data Lake

Within each data lake (MaDaM) analyses will be processed close to the data and subsequently aggregated to a global result.

A global company with multiple research centers can build several data storage centers, each with one MaDaM. They should be located where the most access to the data will occur in future. Engineers can use their ‘local’ MaDaM for most of their analysis tasks. If global analysis is required, that MaDaM communicates to all the others and distributes both the search request and the analysis definition. Each MaDaM handles the request and sends the results back to the requesting MaDaM, where all the results are aggregated and used for the final report.