Skip to content
Core Concepts Last updated: May 14, 2026

Apache Iceberg Spec v4 (Current State)

Apache Iceberg Spec v4 is in early community discussion and proposal stages as of 2025, with potential features including extended geospatial capabilities, enhanced variant type operations, improved row lineage, catalog-level transaction semantics, and multi-table ACID operations — building on the Spec v3 foundation.

iceberg spec v4iceberg format version 4iceberg v4 roadmapiceberg future featuresiceberg community roadmap

Apache Iceberg Spec v4: Current State

Apache Iceberg Spec v4 represents the next horizon of the Iceberg format specification, with early discussions, proposals, and design explorations happening in the Apache Iceberg community in 2025. As of now, Spec v4 is not formally released or finalized — it exists in the form of design documents, GitHub issues, community discussions, and early-stage proposals (IEPs — Iceberg Enhancement Proposals).

Understanding what’s being discussed for Spec v4 is valuable for architects planning long-term lakehouse strategies and for teams tracking where the Iceberg ecosystem is heading.

Important context: Spec v3 itself is still being broadly adopted across engines as of 2025. Spec v4 features are exploratory and subject to significant change. Follow the Apache Iceberg GitHub and the community mailing lists for the most current status.

What Spec v3 Established (The Foundation for v4)

Before discussing v4, it’s important to understand what Spec v3 delivered:

Spec v4 discussions start from this foundation and address capabilities that v3 didn’t fully solve.

Areas Being Discussed for Spec v4

1. Multi-Table Transactions

One of the most requested capabilities in the Iceberg community: atomic commits across multiple tables. Currently, each Iceberg table has its own isolated snapshot — there’s no mechanism to atomically commit to tables A, B, and C simultaneously with ACID guarantees.

Proposed approaches:

This would enable operations like “atomically move data from staging.orders to prod.orders and update prod.order_summary in one transaction.”

2. Enhanced Variant Type Operations

Spec v3’s Variant type provides storage; v4 discussions explore:

3. Extended Geospatial Capabilities

Building on v3’s geometry types:

4. Improved Row Lineage

Spec v3 introduces row lineage tracking; v4 may formalize:

5. Catalog-Level Extensions to the Spec

Discussion around whether catalog behaviors (multi-catalog transactions, cross-catalog references) should be formalized in the Iceberg spec rather than being catalog-specific:

6. Compute Statistics Integration

While Puffin already provides the blob format for statistics, v4 may formalize:

How to Track Spec v4 Progress

The Iceberg community uses several channels for spec evolution:

The Spec Evolution Philosophy

A key principle of the Iceberg spec governance:

  1. Backward compatibility: Every new spec version must be able to read all previous versions.
  2. Engine negotiation: Engines declare which spec version they support; tables can only be upgraded to versions all reading engines support.
  3. Community consensus: Spec changes require broad community agreement — no single vendor controls the spec direction.

This governance model is what makes Iceberg uniquely trustworthy for long-term architecture decisions: no single vendor can unilaterally break compatibility or change the spec in ways that disadvantage others.

Planning for Spec v4

For architects today:

  1. Adopt Spec v2 immediately: The current production standard for all new tables.
  2. Plan for Spec v3: Evaluate deletion vectors and Variant type for tables where these features provide clear value. Expect broad engine support in 2025–2026.
  3. Monitor Spec v4: Follow community discussions for multi-table transaction proposals — this will be transformative when it arrives.
  4. Don’t over-anticipate: Build on what’s stable (v2) and tested (v3 in progress) rather than waiting for v4 features that don’t have committed timelines.

📚 Go Deeper on Apache Iceberg

Alex Merced has authored three hands-on books covering Apache Iceberg, the Agentic Lakehouse, and modern data architecture. Pick up a copy to master the full ecosystem.

← Back to Iceberg Knowledge Base