Category Trends

Why Iceberg

Iceberg has gained a huge amount of popularity in recent years, but why is this table format now finding such widespread adoption? There are a number of reasons and I shall attempt to explain them in this post. Before Iceberg,…

The Future of Data Engineering

Is the answer AI? Ummm….not yet (correct at time of writing). Data engineering today looks remarkably different from five years ago. The role that emerged to build Hadoop clusters and write MapReduce jobs has evolved into something unrecognizable from its…

REST vs GraphQL

The question of REST versus GraphQL has become one of those debates in software engineering where everyone has an opinion, most of it strong (if sometimes wrong), and much of it colored by their most recent project experience. Like most…

Data Models 1970

Data is not just the new oil; it’s the new language of decision-making. But before machine learning, embeddings, or even NoSQL, data had to be structured, described, and designed. Understanding how data models evolved, from Edgar Codd, Peter Chen and…

Data Modeling for BI – Best Practices

Most business intelligence (BI) initiatives fail not because of bad technology choices or insufficient data, but because the underlying data model doesn’t align with how business users think about their questions. Analysts build elegant dimensional models following textbook principles, but…

What Will Replace SQL?

Nothing. Ok, I should probably write a bit more than that. Nothing replaces SQL outright. Instead, three complementary layers are emerging around SQL. It will likely be with us forever, but it’s interesting to understand what alternatives exist and why.…

Rise and Fall(?) of the Data Warehouse

For nearly three decades, the enterprise data warehouse reigned supreme. It was the single source of truth, the vault where all valuable corporate data lived, cleaned, and neatly structured for consumption. Vendors promised that if you put everything in the…

From Tape to Terabytes

We don’t really think about where the data is physically stored. In the digital world, storage is the unsung hero. Data scientists, AI models and analytics tools often take centre stage, but none of them work without a place to…

Data as a Product

Treating data as a product works when you fuse continuous discovery habits (Teresa Torres: outcomes, opportunity solution trees, assumption testing) with a platform that enforces contracts, SLOs, and governance (catalogs, open table formats, policy-as-code, observability). The result: faster iteration with…

Data Contracts & Collaboration

In modern organisations, data moves across teams, tools, and systems faster than ever before. With this speed comes a persistent risk: the moment a producer changes the shape, meaning, or frequency of data without warning, downstream consumers can experience silent…