Hi there! My name is Paulo and I am an analytics/data engineer and endurance runner based in Berlin. Welcome to my blog.

Taking snapshots using dbt in Azure Synapse: The Hash Distribution Issue

Taking snapshots using dbt in Azure Synapse: The Hash Distribution Issue

Learn how to troubleshoot and resolve the "Hash Distributed Table" error when taking snapshots using dbt in Azure Synapse Dedicated Pool

Understanding DAGs in dbt: Erring Models and Failing Tests

Understanding DAGs in dbt: Erring Models and Failing Tests

Three scenarios shows us what happens downstream when our models and/or tests do not succeed

dbt seed: Data Warehousing and CSV Files

dbt seed: Data Warehousing and CSV Files

The following are a few points I have learned while working with seeds in my dbt project

Surf, Work, and Explore: A Web Scraping Exercise

Surf, Work, and Explore: A Web Scraping Exercise

I scrape data using python to find new countries with surf spots where you can work remotely from, ceteris paribus

Effortless Documentation with dbt: Streamlining Data Warehouse Documentation

Effortless Documentation with dbt: Streamlining Data Warehouse Documentation

A step-by-step approach to documenting tables and columns, leveraging YAML and Markdown files. Learn how to create comprehensive documentation, including tests, with the help of Jinja scripts. Enhance collaboration, improve understanding, and ensure the clarity of your data models with dbt's powerful documentation capabilities

My Airflow Journey

My Airflow Journey

Starting from scratch is impractical, Google Cloud Platform's Composer is expensive, and debugging Airflow on Windows is challenging

Linking to BigQuery in Azure Data Factory

Linking to BigQuery in Azure Data Factory

In this article, we go over the steps, requirements and recommendations when creating a BigQuery linked service in Azure Data Factory

Nudging The End User

Nudging The End User

In PBI, you can nudge the user towards selecting the data they need first, before it is actually displayed in the report

Is My dbt Ready?

Is My dbt Ready?

Set up activities in Azure Data Factory to track when your containerized dbt project has finished running