Taking snapshots using dbt in Azure Synapse: The Hash Distribution Issue
Learn how to troubleshoot and resolve the "Hash Distributed Table" error when taking snapshots using dbt in Azure Synapse Dedicated Pool
When I am not building data pipelines, you will find me training for my next endurance race. I write about both.
Currently preparing for Spartathlon 2026 and documenting the journey in my 246 series.
Learn how to troubleshoot and resolve the "Hash Distributed Table" error when taking snapshots using dbt in Azure Synapse Dedicated Pool
Three scenarios shows us what happens downstream when our models and/or tests do not succeed
The following are a few points I have learned while working with seeds in my dbt project
I scrape data using python to find new countries with surf spots where you can work remotely from, ceteris paribus
A step-by-step approach to documenting tables and columns, leveraging YAML and Markdown files. Learn how to create comprehensive documentation, including tests, with the help of Jinja scripts. Enhance collaboration, improve understanding, and ensure the clarity of your data models with dbt's powerful documentation capabilities
Starting from scratch is impractical, Google Cloud Platform's Composer is expensive, and debugging Airflow on Windows is challenging
In this article, we go over the steps, requirements and recommendations when creating a BigQuery linked service in Azure Data Factory
In PBI, you can nudge the user towards selecting the data they need first, before it is actually displayed in the report
Set up activities in Azure Data Factory to track when your containerized dbt project has finished running