Category Cloud

Serverless Databases

The term serverless is somewhat misleading because servers obviously exist somewhere. This is not magic. For decades, deploying a database meant answering impossible questions. How much capacity do you need? How many CPU cores, how much memory, how much disk?…

Snowflake’s Indexing Methods

I have various posts on this site that talk to indexes. But Snowflake’s indexing and search approach is quite different from traditional approaches like B-trees. It doesn’t maintain B-tree indexes like row-based relational databases, like MySQL or PostgreSQL. No User-Created…

Delayed View Semantics

Streaming data systems and relational databases each offer immense value. Streaming excels at delivering low-latency, real-time insights; relational databases give strong consistency, rich transactional guarantees, and mature tooling. Yet, integrating these worlds without sacrificing usability or correctness has long been…

Databricks Indexing

Databricks (like Snowflake) doesn’t rely on traditional B-trees, because it’s built on a cloud-native, columnar, distributed file architecture. It avoids B-trees entirely because the cost of maintaining per-row index structures would destroy the scalability benefits of its append-only, distributed Parquet…