Posts
All the articles I've posted.
The Metadata Structure of Modern Table Formats
Published: at 12:01 PMIceberg uses a metadata tree, Delta Lake uses a transaction log, Hudi uses a timeline. Here is exactly how each format organizes metadata and why it matters.
Row vs. Column: How Storage Layout Shapes Everything
Published: at 12:01 PMRow stores keep records together for fast transactions. Column stores keep field values together for fast analytics. Here is how each layout works and when to use it.
What Are Table Formats and Why Were They Needed?
Published: at 12:00 PMTable formats like Apache Iceberg solved the ACID, schema, and performance problems that turned data lakes into data swamps. Here is how each one works.