Hi there! My name is Paulo and I am a freelance data all-arounder based in Berlin, Germany. Welcome to my blog.

A simple Dockerfile for a dbt Synapse

A simple Dockerfile for a dbt Synapse

A sample Dockerfile to containerize a dbt project with Azure Synapse dependencies

Applying Scalers using DataFrameMapper()

Applying Scalers using DataFrameMapper()

An example on how to scale and hot-encode variables while preprocessing your data frame.

Creating Sample Datasets in Python

Creating Sample Datasets in Python

A script to create sample pandas data frames with randomly generated data.

Working from Santa Teresa, Costa Rica

Working from Santa Teresa, Costa Rica

Strong waves (surf, not WiFi)

Gathering Questions about GCP

Gathering Questions about GCP

A sample of questions about GCP used as a study-guide for Google's Associate Cloud Engineer Certification

Mapping of Postal and District Codes

Mapping of Postal and District Codes

A mapping from Austrian 4-digit postal codes to Austrian 3-digit district codes