Linking to BigQuery in Azure Data Factory
In this article, we go over the steps, requirements and recommendations when creating a BigQuery linked service in Azure Data Factory
Set up activities in Azure Data Factory to track when your containerized dbt project has finished running
Stay on top of errors that may come up when running your dbt build
A description of how to include Azure Container Instance logs into your Data Factory pipeline.
An example on how to run scripts within an Azure Data Factory pipeline
An example on how to connect Azure Batch and Azure Storage accounts to Azure Data Factory
Testing expression is a simple trick in Azure Data Factory, so that you are able to see what is being passed on to your conditional activities.
Starting from scratch is impractical, Google Cloud Platform's Composer is expensive, and debugging Airflow on Windows is challenging
And thus starting your CI/CD pipeline
Use custom macros to remove deprecated tables and views from your target schema
A custom macro to create date dimensions when using the dbt-synapse adapter
A work-around to the default settings when creating tables using the dbt-synapse adapter
Learn how to troubleshoot and resolve the "Hash Distributed Table" error when taking snapshots using dbt in Azure Synapse Dedicated Pool
Set up activities in Azure Data Factory to track when your containerized dbt project has finished running
Stay on top of errors that may come up when running your dbt build
A description of how to include Azure Container Instance logs into your Data Factory pipeline.
A sample Dockerfile to containerize a dbt project with Azure Synapse dependencies
A summary of three ideas on pausing dbt resources: leveraging the exclusion flag, disabling resources, and transforming models into analyses
A brief introduction to project variables and their benefit when modeling in dbt
Three scenarios show us the implications of running a Directed Acyclic Graph (DAG) in multi-threaded dbt environments, shedding light on how errors and the "fail fast" argument impact model execution
Stay on top of errors that may come up when running your dbt build
Chances are, you will be needing scalers to make predictions on new data sets
Berlin Marathon results among runners from 1974 to 2023, including data from Kaggle for the years 1974-2019 and additional results from 2021-2023
A mapping from Austrian 4-digit postal codes to Austrian 3-digit district codes
A brief description of features from CSV Rainbow and dbt Power User, both extensions for VS Code
Use custom macros to remove deprecated tables and views from your target schema
A summary of three ideas on pausing dbt resources: leveraging the exclusion flag, disabling resources, and transforming models into analyses
A custom macro to create date dimensions when using the dbt-synapse adapter
A brief introduction to exposures and why they should be in every dbt project
A brief introduction to project variables and their benefit when modeling in dbt
A work-around to the default settings when creating tables using the dbt-synapse adapter
Three scenarios show us the implications of running a Directed Acyclic Graph (DAG) in multi-threaded dbt environments, shedding light on how errors and the "fail fast" argument impact model execution
Learn how to troubleshoot and resolve the "Hash Distributed Table" error when taking snapshots using dbt in Azure Synapse Dedicated Pool
Three scenarios shows us what happens downstream when our models and/or tests do not succeed
The following are a few points I have learned while working with seeds in my dbt project
A step-by-step approach to documenting tables and columns, leveraging YAML and Markdown files. Learn how to create comprehensive documentation, including tests, with the help of Jinja scripts. Enhance collaboration, improve understanding, and ensure the clarity of your data models with dbt's powerful documentation capabilities
Set up activities in Azure Data Factory to track when your containerized dbt project has finished running
Stay on top of errors that may come up when running your dbt build
A sample Dockerfile to containerize a dbt project with Azure Synapse dependencies
A compilation of a few commands when developing new models in dbt
A sample Dockerfile to containerize a dbt project with Azure Synapse dependencies
A brief description of features from CSV Rainbow and dbt Power User, both extensions for VS Code
Use custom macros to remove deprecated tables and views from your target schema
A summary of three ideas on pausing dbt resources: leveraging the exclusion flag, disabling resources, and transforming models into analyses
A custom macro to create date dimensions when using the dbt-synapse adapter
A brief introduction to exposures and why they should be in every dbt project
A brief introduction to project variables and their benefit when modeling in dbt
A work-around to the default settings when creating tables using the dbt-synapse adapter
Three scenarios show us the implications of running a Directed Acyclic Graph (DAG) in multi-threaded dbt environments, shedding light on how errors and the "fail fast" argument impact model execution
Learn how to troubleshoot and resolve the "Hash Distributed Table" error when taking snapshots using dbt in Azure Synapse Dedicated Pool
Three scenarios shows us what happens downstream when our models and/or tests do not succeed
The following are a few points I have learned while working with seeds in my dbt project
A step-by-step approach to documenting tables and columns, leveraging YAML and Markdown files. Learn how to create comprehensive documentation, including tests, with the help of Jinja scripts. Enhance collaboration, improve understanding, and ensure the clarity of your data models with dbt's powerful documentation capabilities
Set up activities in Azure Data Factory to track when your containerized dbt project has finished running
An example on how to run scripts within an Azure Data Factory pipeline
An example on how to connect Azure Batch and Azure Storage accounts to Azure Data Factory
Testing expression is a simple trick in Azure Data Factory, so that you are able to see what is being passed on to your conditional activities.
A short example on how to use Google Fonts within ggplot graphs
And thus starting your CI/CD pipeline
A sample of questions about GCP used as a study-guide for Google's Associate Cloud Engineer Certification
A few tips on how to avoid surprises at the end of the month
An example on how to scale and hot-encode variables while preprocessing your data frame.
A script to create sample pandas data frames with randomly generated data.
Some ideas on new features that can be created based on date features
Chances are, you will be needing scalers to make predictions on new data sets
Levering R and OpenStreetMaps to create custom minimalist posters
A mapping from Austrian 4-digit postal codes to Austrian 3-digit district codes
Some tips to find the right features and tags for the elements you are mapping in R
A mapping from Austrian 4-digit postal codes to Austrian 3-digit district codes
All you need is the GeoJSON file of the map you would like to plot on
I saw multiple people struggling with this topic on Stack Overflow. Here is simple and quick way of importing excel data located in a Personal OneDrive into Python
In PBI, you can nudge the user towards selecting the data they need first, before it is actually displayed in the report
Highlights in Power BI are a great presentation tool which can unfortunately act oddly when exporting to PDF
And make them stand out from all other datasets in your account
Saving you and your team priceless time when creating reports for your organization
Keep your logo and page layout standard across the your organization
Save time and make your Power BI reports look professional by creating custom themes.
Give your stakeholders the ability to know when your data was last refreshed precisely
A common but not intuitive task.
A brief description of the three main services offered by Power BI
I scrape data using python to find new countries with surf spots where you can work remotely from, ceteris paribus
Stay on top of errors that may come up when running your dbt build
An example on how to scale and hot-encode variables while preprocessing your data frame.
A script to create sample pandas data frames with randomly generated data.
gender-guesser is a python package that allows you to match a first names and name origin to a gender
Some ideas on new features that can be created based on date features
Chances are, you will be needing scalers to make predictions on new data sets
I saw multiple people struggling with this topic on Stack Overflow. Here is simple and quick way of importing excel data located in a Personal OneDrive into Python
Levering R and OpenStreetMaps to create custom minimalist posters
Some tips to find the right features and tags for the elements you are mapping in R
A short example on how to use Google Fonts within ggplot graphs
Selected quotes from what I considered were the best books I read/listened to in 2023.
Selected quotes from what I considered were the best books I read/listened to in 2022.
Selected quotes from what I considered were the best books I read/listened to in 2021.
A few thoughts on what I considered were the best books I read/listened to in 2020.
I scrape data using python to find new countries with surf spots where you can work remotely from, ceteris paribus
Some learnings from my travels in the nomad island
An account of unexpected challenges and forging unlikely friendships during the grueling trail ultramarathon across Gran Canaria
Berlin Marathon results among runners from 1974 to 2023, including data from Kaggle for the years 1974-2019 and additional results from 2021-2023
The story of a rollercoaster ride dealing with foot inflammation and my journey back to running
My experience running the Berlin Wall, my first 100-mile foot race
Running alongside the Mauerweg on a cold November day
I insist, the color of the trail marker has nothing to do with the trail's difficulty!
Discovering the power of alcohol-free beer on a warm July 4th
My experience following Eric Orton’s running plan, the Foundation Program
Use custom macros to remove deprecated tables and views from your target schema
A custom macro to create date dimensions when using the dbt-synapse adapter
A work-around to the default settings when creating tables using the dbt-synapse adapter
Learn how to troubleshoot and resolve the "Hash Distributed Table" error when taking snapshots using dbt in Azure Synapse Dedicated Pool
A brief description of features from CSV Rainbow and dbt Power User, both extensions for VS Code