Archive
Archive
2025
- January 7 - OLake Architecture, How did we do it?
- March 18 - JSON vs BSON vs JSONB - A Detailed Comparison
- March 18 - Data lake vs Delta Lake
- March 18 - What Are Binlogs?
- April 22 - A Deep Dive into OLake Architecture and Inner Workings
- April 23 - How to Set Up PostgreSQL CDC on AWS RDS - A Step-by-Step Guide
- April 30 - Running OLake on Kubernetes with Your Existing Airflow, Sync Your Data Effortlessly
- May 7 - What makes OLake fast?
- May 8 - Running OLake on EC2 with Your Existing Airflow, Sync Your Data Effortlessly
- July 29 - Building Modern Lakehouse with Iceberg, OLake, Lakekeeper & Trino
- July 29 - OLake Ingestion Filters Explained: SQL‑style WHERE‑Clause Support for Postgres, MySQL & MongoDB
- July 31 - Apache Iceberg vs Delta Lake: Comparison for Batch Analytics & ML Pipelines
- October 1 - Building a Complete Open Data Lakehouse from Scratch with OLake, PrestoDB and MinIO
2024
- September 1 - How to Set Up PostgreSQL CDC on AWS RDS - A Step-by-Step Guide
- September 16 - Four Critical MongoDB ETL Challenges and How to tackle them for your Data Lake and Data Warehouse?
- September 24 - How to Query Semi-Structured JSON Data in Snowflake?
- October 10 - 7 Proven Techniques for Handling Changing Data Type during Semi-Structured Data Ingestion a.k.a Polymorphic Keys
- October 18 - How to Flatten Object Types and Query Arrays in Semi-Structured Nested JSON for Effective Data Extraction
- November 5 - MongoDB Synchronization Strategies Explained - Pros, Cons, and Practical Tips
- November 11 - MongoDB CDC using Debezium and Kafka
- November 21 - Common Challenges Using Debezium and Kafka Connect for CDC
- November 22 - Problems with Debezium and How we (OLake, Open-Source) solve it?