Category Trends

Quantum & Data Security – Oh Sh#t!

The encryption protecting your data today relies on mathematical problems that are practically impossible for classical computers to solve. Factoring large numbers, computing discrete logarithms, and similar hard problems form the foundation of RSA, ECC, and Diffie-Hellman, the cryptographic algorithms…

The Persistence of Microsoft Access

It first appeared in 1992, bundled with Microsoft Office, promising a way for anyone to build a database without writing a line of SQL. Three decades later, Microsoft Access is still here, quietly running in the background of finance departments,…

B-Tree Alternatives

For decades, the B-tree has been the backbone of database indexing, enabling efficient lookups, inserts, and range queries in block-oriented storage systems. For more information, please see my post on B-Trees: Its balanced structure and high fanout made it the…

Build a Data Marketplace

We all understand the concept of a marketplace. But it’s usually for things we spend money on to acquire. In today’s digital economy, data also has value. Real, tangible value realised through data products, analytics and whatever AI is capable…

TDD for data. Is this a thing?

Test-Driven Development (TDD) has been a staple of software engineering for decades. It was first proposed by Kent Beck in the late 1990s and early 2000s. He popularized TDD as part of Extreme Programming (XP) and wrote the influential book…

Which DataFrame?

In modern data architecture, clarifying the distinction between file formats (like Avro, Parquet) and table formats (like Iceberg, Delta Lake), as well as interoperability layers (XTable), is critical. Your choice guides performance, mutability, governance, and platform flexibility. This post gives…