Hi there! My name is Paulo and I am a freelance data all-arounder based in Berlin, Germany. Welcome to my blog.

Understanding DAGs in dbt: Threads, Errors and Failing Fast

Understanding DAGs in dbt: Threads, Errors and Failing Fast

Three scenarios show us the implications of running a Directed Acyclic Graph (DAG) in multi-threaded dbt environments, shedding light on how errors and the "fail fast" argument impact model execution

Taking snapshots using dbt in Azure Synapse: The Hash Distribution Issue

Taking snapshots using dbt in Azure Synapse: The Hash Distribution Issue

Learn how to troubleshoot and resolve the "Hash Distributed Table" error when taking snapshots using dbt in Azure Synapse Dedicated Pool

Understanding DAGs in dbt: Erring Models and Failing Tests

Understanding DAGs in dbt: Erring Models and Failing Tests

Three scenarios shows us what happens downstream when our models and/or tests do not succeed

dbt seed: Data Warehousing and CSV Files

dbt seed: Data Warehousing and CSV Files

The following are a few points I have learned while working with seeds in my dbt project

Surf, Work, and Explore: A Web Scraping Exercise

Surf, Work, and Explore: A Web Scraping Exercise

I scrape data using python to find new countries with surf spots where you can work remotely from, ceteris paribus

Effortless Documentation with dbt: Streamlining Data Warehouse Documentation

Effortless Documentation with dbt: Streamlining Data Warehouse Documentation

A step-by-step approach to documenting tables and columns, leveraging YAML and Markdown files. Learn how to create comprehensive documentation, including tests, with the help of Jinja scripts. Enhance collaboration, improve understanding, and ensure the clarity of your data models with dbt's powerful documentation capabilities