Hi there! My name is Paulo and I am an analytics/data engineer and endurance runner based in Berlin. Welcome to my blog.

A simple Dockerfile for a dbt Synapse

A simple Dockerfile for a dbt Synapse

A sample Dockerfile to containerize a dbt project with Azure Synapse dependencies

Applying Scalers using DataFrameMapper()

Applying Scalers using DataFrameMapper()

An example on how to scale and hot-encode variables while preprocessing your data frame.

Creating Sample Datasets in Python

Creating Sample Datasets in Python

A script to create sample pandas data frames with randomly generated data.

Working from Santa Teresa, Costa Rica

Working from Santa Teresa, Costa Rica

Strong waves (surf, not WiFi)

Gathering Questions about GCP

Gathering Questions about GCP

A sample of questions about GCP used as a study-guide for Google's Associate Cloud Engineer Certification

Mapping of Postal and District Codes

Mapping of Postal and District Codes

A mapping from Austrian 4-digit postal codes to Austrian 3-digit district codes

Working from Corralejo, Fuerteventura

Working from Corralejo, Fuerteventura

Some learnings from my travels in the nomad island

Using Number Filters in Metabase

Using Number Filters in Metabase

A work around to slider-like filters in Metabase

Matching First Names and Gender using gender-guesser

Matching First Names and Gender using gender-guesser

gender-guesser is a python package that allows you to match a first names and name origin to a gender