Blogs on OLake
Deep Dive into Kafka as a Source in OLake: Unpacking Sync, Concurrency, and Partition Mastery
Explore OLake's Kafka source connector—featuring schema discovery, custom group balancing, partition-aware concurrency, and incremental batch sync to Apache Iceberg with exactly-once semantics.
Postgres → Iceberg → Doris: A Smooth Lakehouse Journey Powered by Olake
Learn how to build a complete lakehouse architecture using PostgreSQL, Apache Iceberg, and Apache Doris for real-time analytics. Step-by-step guide with OLake for seamless data ingestion.
Building a Serverless Iceberg Lakehouse: OLake's Speed + Bauplan's Git Workflows
Learn how OLake and Bauplan work together to create a powerful, version-controlled data lakehouse on Apache Iceberg.
Parquet vs. Iceberg: From File Format to Data Lakehouse King
Understand how Apache Parquet and Apache Iceberg complement each other — the foundation and blueprint for building reliable, scalable data lakehouses.
7× Faster Iceberg Writes: How We Rebuilt OLake's Destination Pipeline
Technical deep dive into our destination refactor: exactly-once visible state, atomic commits, and a 7× throughput boost.
Building a Scalable Lakehouse with Iceberg, Trino, OLake & Apache Polaris
Learn how OLake, Iceberg, Lakekeeper, and Trino create a scalable, secure, and real-time modern data lakehouse architecture for analytics.












