Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124

Every generation of data tooling has its keystone. In the 1980s, relational databases defined the foundation. In the 2000s, Hadoop represented a seismic shift in scale. Today, in the cloud-first…

The AI boom has brought a new class of databases into the spotlight: vector databases. But in the background triple stores have been quietly powering knowledge graphs and the Semantic…

I always find joy in understanding concepts at a deep and fundamental level. This stems from a love of mathematics and the mental journeys you can embark on. Being able…

While the star schema dominates modern data warehousing, there’s another dimensional modeling pattern that occasionally enters the conversation: the snowflake schema. Named for its branching, crystalline structure when diagrammed, this…

When you’re searching petabytes of data, you can’t afford to scan every record or even every file. You need a quick way to answer the question: “Is this key possibly…

In today’s era of big data, organizations face the challenge of processing vast datasets with varying timeliness and accuracy requirements. The Lambda Architecture, introduced by Nathan Marz, offers a solution…

Change Data Capture (CDC) is one of those concepts that seems deceptively simple: Just capture the changes But it soon becomes complex once you start implementing at scale. In today’s…

For over a decade (I had to check this as it made me feel old) Pandas has been the go-to Python library for data analysis. Its DataFrame API has shaped…

In database systems, ACID – Atomicity, Consistency, Isolation and Durability is the cornerstone of transactional reliability. It guarantees that operations are safe, corruption-free, and recoverable. Yet, as data storage models…

The last five years have seen a number of open data table formats vying for position, these include Apache Iceberg, Delta Lake and Apache Hudi. By mid-2025, the winner is…

In data modeling, as in life, the answer to many questions is “it depends.” Should you denormalize your dimensions into simple star schemas or normalize them into snowflake schemas? The…

In other words, Data Mesh. Yes, another technology buzz word to add to the collection? But this one is different. The biggest challenge in building a data mesh is not…
You must be logged in to post a comment.