Delayed View Semantics

Streaming data systems and relational databases each offer immense value. Streaming excels at delivering low-latency, real-time insights; relational databases give strong consistency, rich transactional guarantees, and mature tooling. Yet, integrating…

B-Tree Alternatives

For decades, the B-tree has been the backbone of database indexing, enabling efficient lookups, inserts, and range queries in block-oriented storage systems. For more information, please see my post on…

API Design – Data Products

The term data product has become ubiquitous in modern data organizations, but its meaning often remains fuzzy. Teams talk about building data products, while creating the same old dashboards, reports,…

Knowledge Graphs

Data is everywhere, but meaning is scarce. Businesses increasingly struggle with silos, databases older than most people operating them (hello DB2), inconsistent or missing semantics, and limited discoverability. Knowledge Graphs…

DataFrames. The Wrong Choice?

DataFrames dominate modern data analysis. If I had a £ for every time I typed… import pandas as pd Whether its Pandas/Polars in Python, Spark Data Frames, or R’s original…