Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124

The fastest query is the one you don’t need to make. Zone maps are a lightweight indexing technique that lets the query engine skip large chunks of data by storing summary statistics for each data block. They’re simple, space-efficient, and central to performance in modern columnar and cloud-native databases.
A zone map is a metadata structure that stores minimum and maximum values (and sometimes other stats) for each block of data.
min_value for the blockmax_value for the blockExample: For a block of rows in a sale_date
column:
min_date = 2025-07-01
max_date = 2025-07-15If a query asks for sale_date = '2025-08-01', the engine knows immediately that this block cannot contain matching rows and skips it.
Parquet file with 3 row groups:
| Row Group | Min order_date | Max order_date |
|---|---|---|
| RG1 | 2025-07-01 | 2025-07-10 |
| RG2 | 2025-07-11 | 2025-07-20 |
| RG3 | 2025-07-21 | 2025-07-31 |
SELECT * FROM orders WHERE order_date = '2025-07-15';Zone map pruning:
Zone maps are the first line of defense against unnecessary I/O in modern analytics engines.
They’re simple but powerful — acting as a quick pre-filter before heavier indexes or scans kick in.
When combined with clustering (like Z-ordering) and Bloom filters, zone maps can turn multi-minute full scans into sub-second queries on terabytes of data.