Hi there! My name is Paulo and I am an analytics/data engineer and endurance runner based in Berlin. Welcome to my blog.

Running my First 100-miler

Running my First 100-miler

My experience running the Berlin Wall, my first 100-mile foot race

dbt run commands for model development

dbt run commands for model development

A compilation of a few commands when developing new models in dbt

Preventing Unexpected Billing in GCP

Preventing Unexpected Billing in GCP

A few tips on how to avoid surprises at the end of the month

Running Scripts using Azure Data Factory and Batch, Part II

Running Scripts using Azure Data Factory and Batch, Part II

An example on how to run scripts within an Azure Data Factory pipeline

Running Scripts using Azure Data Factory and Batch, Part I

Running Scripts using Azure Data Factory and Batch, Part I

An example on how to connect Azure Batch and Azure Storage accounts to Azure Data Factory

4 x 4 x 48

4 x 4 x 48

Going through with the running challenge popularized by David Goggins

Deriving Meaning from Date Features for Prediction in Python

Deriving Meaning from Date Features for Prediction in Python

Some ideas on new features that can be created based on date features

Saving and Loading Scalers using MLflow and Databricks with Python

Saving and Loading Scalers using MLflow and Databricks with Python

Chances are, you will be needing scalers to make predictions on new data sets

OpenStreetMap Features and Tags in R

OpenStreetMap Features and Tags in R

Some tips to find the right features and tags for the elements you are mapping in R