Topological sorting and the ETL process

In this paper, we point out that a typical ETL process to populate a database can be thought of as an acyclic directed graph and that existing graph algorithms can thus be used to resolve the order of processing the tables in the load process. While simple databases can easily be managed manually, such methods prove very useful with big and complex data warehouses. We extend an existing sorting algorithm to provide with information on which tables can be loaded in parallel.

Download the paper here.

Thanks for your registration!

Data Intelligence Days - introducing a Data PlatformD ONE is on MediumTalk @ Google Data Cloud Live: AthensX-Mas: A Closer Look at StoriesDan Linstedt @ D ONEMachine learning for productionStrata LondonData Vault 2.0 Bootcamp and certification with Dan Linstedt by D ONEinside-it 9. Mai 2018HWZ Yea(h)rbook 2017 FachbeitragInnovationsschub in Anwaltskanzleien durch Startup Herlock.aiArtificial Intelligence in Claim ManagementAI treibt die smarte Fabrik der Zukunft voranWe're at the Data+AI Summit in San Francisco!Save the Date: common sense18/01GT-Conference Talk: Zhamak DehghaniReal-Time Tracking of Swiss Covid-19 CasesBuilding a Data Vault using dbtvault with google BigQueryStart up Winji in der NZZMoving from SAP BW to Databricks - Live from the DATA+AI Summit in San Francisco