Topological sorting and the ETL process

In this paper, we point out that a typical ETL process to populate a database can be thought of as an acyclic directed graph and that existing graph algorithms can thus be used to resolve the order of processing the tables in the load process. While simple databases can easily be managed manually, such methods prove very useful with big and complex data warehouses. We extend an existing sorting algorithm to provide with information on which tables can be loaded in parallel.

Download the paper here.

Thanks for your registration!

Talk @ Google Data Cloud Live: AthensMachine learning for productionGT-Conference Talk: Zhamak DehghaniStrata LondonMoving from SAP BW to Databricks - Live from the DATA+AI Summit in San FranciscoWe're at the Data+AI Summit in San Francisco!D ONE is on MediumMaking NLP easyBuilding a Data Vault using dbtvault with google BigQueryReal-Time Tracking of Swiss Covid-19 CasesData Intelligence Days - introducing a Data PlatformStart up Winji in der NZZInnovationsschub in Anwaltskanzleien durch Startup Herlock.aiHWZ Yea(h)rbook 2017 FachbeitragSave the Date: common sense18/01inside-it 9. Mai 2018Insights of the Data Vault 2.0 Bootcamp with Dan LinstedtDan Linstedt @ D ONEX-Mas: A Closer Look at StoriesAI treibt die smarte Fabrik der Zukunft voran