Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124

Iceberg has gained a huge amount of popularity in recent years, but why is this table format now finding such widespread adoption? There are a number of reasons and I shall attempt to explain them in this post. Before Iceberg,…

Is the answer AI? Ummm….not yet (correct at time of writing). Data engineering today looks remarkably different from five years ago. The role that emerged to build Hadoop clusters and write MapReduce jobs has evolved into something unrecognizable from its…

The question of REST versus GraphQL has become one of those debates in software engineering where everyone has an opinion, most of it strong (if sometimes wrong), and much of it colored by their most recent project experience. Like most…

Data is not just the new oil; it’s the new language of decision-making. But before machine learning, embeddings, or even NoSQL, data had to be structured, described, and designed. Understanding how data models evolved, from Edgar Codd, Peter Chen and…

Most business intelligence (BI) initiatives fail not because of bad technology choices or insufficient data, but because the underlying data model doesn’t align with how business users think about their questions. Analysts build elegant dimensional models following textbook principles, but…

Nothing. Ok, I should probably write a bit more than that. Nothing replaces SQL outright. Instead, three complementary layers are emerging around SQL. It will likely be with us forever, but it’s interesting to understand what alternatives exist and why.…

For nearly three decades, the enterprise data warehouse reigned supreme. It was the single source of truth, the vault where all valuable corporate data lived, cleaned, and neatly structured for consumption. Vendors promised that if you put everything in the…

We don’t really think about where the data is physically stored. In the digital world, storage is the unsung hero. Data scientists, AI models and analytics tools often take centre stage, but none of them work without a place to…

Treating data as a product works when you fuse continuous discovery habits (Teresa Torres: outcomes, opportunity solution trees, assumption testing) with a platform that enforces contracts, SLOs, and governance (catalogs, open table formats, policy-as-code, observability). The result: faster iteration with…

In modern organisations, data moves across teams, tools, and systems faster than ever before. With this speed comes a persistent risk: the moment a producer changes the shape, meaning, or frequency of data without warning, downstream consumers can experience silent…
You must be logged in to post a comment.