Posts
All the articles I've posted.
- 4 MIN READ•Mar 7, 2026
What is Apache Parquet? Columns, Encoding, and Performance
If you ask a data analyst to calculate the average transaction amount for the month of July using a massive CSV file, the compute engine must read eve...
data lakehouseapache icebergapache parquet - 21 MIN READ•Mar 7, 2026
What is Apache Polaris? Unifying the Iceberg Ecosystem
Treating thousands of Parquet files as a unified database table requires a brain. Apache Iceberg provides the metadata structure to do this, but the Iceberg specification alone does not manage security roles, handle network requests, or broker credentials. You need an open catalog service to orchestrate those root metadata pointers. Apache Polaris serves as that open-source, vendor-neutral brain. This comprehensive guide explains the catalog fragmentation war, open governance under the Apache Software Foundation, role-based access control hierarchies, credential vending vs IAM sprawl, and how Polaris powers Dremio's agentic query acceleration.
data lakehouseapache icebergapache parquet - 6 MIN READ•Mar 7, 2026
Apache Software Foundation: History, Purpose, and Process
If you build a modern data lakehouse, you inevitably stack Apache Iceberg, Apache Parquet, and Apache Arrow. These projects dictate how you store, que...
data lakehouseapache icebergapache parquet