Topological sorting and the ETL process

In this paper, we point out that a typical ETL process to populate a database can be thought of as an acyclic directed graph and that existing graph algorithms can thus be used to resolve the order of processing the tables in the load process. While simple databases can easily be managed manually, such methods prove very useful with big and complex data warehouses. We extend an existing sorting algorithm to provide with information on which tables can be loaded in parallel.

Download the paper here.

Thanks for your registration!

Building a Data Vault using dbtvault with google BigQueryX-Mas: A Closer Look at StoriesData Vault 2.0 Bootcamp and certification with Dan Linstedt by D ONE“Information Permeability” - how you can improve information flows in your company.HWZ Yea(h)rbook 2017 FachbeitragTalk @ Google Data Cloud Live: AthensReal-Time Tracking of Swiss Covid-19 CasesGT-Conference Talk: Zhamak DehghaniDan Linstedt @ D ONEWe're at the Data+AI Summit in San Francisco!Making NLP easyData Intelligence Days - introducing a Data PlatformMoving from SAP BW to Databricks - Live from the DATA+AI Summit in San FranciscoInsights of the Data Vault 2.0 Bootcamp with Dan LinstedtAI treibt die smarte Fabrik der Zukunft voranStart up Winji in der NZZD ONE is on MediumMachine learning for productionStrata LondonArtificial Intelligence in Claim Management