Skip to main content

The Blog

Thoughts on data engineering, software, and the occasional rabbit hole.

Featured

Why I Created a GitHub Organisation (And Why You Probably Should Too)

5 min read

I've had a personal GitHub account for years. It does what it needs to do — it holds my repos, tracks my contributions, and occasionally embarrasses me with commits from 2018 that I'd rather not think about.

6 min read

AI Agents: The Good, the Frustrating, and the Genuinely Useful

Full transparency upfront: this post was written by a GitHub Copilot agent on my behalf. I gave it the brief, it wrote the words. My thoughts, its keyboard. Make of that what you will.

4 min read

How I Used an AI Agent to Modernize My Portfolio

I recently handed the keys of my portfolio site over to a GitHub Copilot coding agent and asked it to modernize the whole thing — version upgrades, UI polish, dependency management, the lot. Here's what that actually looked like in practice.

6 min read

Dimensional Modeling 101 - Design Better Data Warehouses

If you're building a data warehouse, dimensional modeling is your best friend. It makes data easier to query, understand, and analyze. Let's break down this powerful technique!

6 min read

Data Quality Validation - Ensuring Your Data is Trustworthy

Bad data leads to bad decisions. As data engineers, one of our most important jobs is ensuring data quality. Let's explore how to validate and maintain high-quality data!

5 min read

Getting Started with Apache Airflow - Orchestrate Your Data Pipelines

Apache Airflow has become the go-to tool for orchestrating data workflows. If you've ever needed to run tasks in a specific order, on a schedule, with dependencies - Airflow is your friend!

4 min read

How to Choose the Right Database for Your Data Project

Choosing a database feels a bit like choosing a car - there's no single "best" option, only the best option for your specific needs. Let's navigate this together!

3 min read

Introduction to Data Pipelines - Your First Step in Data Engineering

Hey there! If you're stepping into the world of data engineering, you've probably heard the term "data pipeline" thrown around quite a bit. Let's break down what they are and why they're so important.

1 min read

Welcome

Welcome to my new blog