2025​
- January 7 - OLake Architecture, How did we do it?
- March 18 - JSON vs BSON vs JSONB - A Detailed Comparison
- March 18 - Data Lake vs Delta Lake: Key Differences and Use Cases for Modern Analytics
- March 18 - What Are Binlogs?
- April 22 - A Deep Dive into OLake Architecture and Inner Workings
- April 23 - How to Set Up PostgreSQL CDC on AWS RDS - A Step-by-Step Guide
- April 30 - Running OLake on Kubernetes with Your Existing Airflow, Sync Your Data Effortlessly
- May 7 - What makes OLake fast?
- May 8 - Running OLake on EC2 with Your Existing Airflow, Sync Your Data Effortlessly
- July 29 - Building Modern Lakehouse with Iceberg, OLake, Lakekeeper & Trino
- July 29 - OLake Ingestion Filters: Explained and Optimized for Faster Data Replication
- July 31 - Apache Iceberg vs Delta Lake: The Ultimate Guide to Data Lakehouse Technologies
- August 12 - Building an Open Data Lakehouse: Integrating OLake, PrestoDB, MinIO, and Apache Iceberg
- August 29 - Deploying OLake on Kubernetes with Helm: Simplifying Lakehouse Management
- September 4 - Comparing Delete Methods in Iceberg and Delta Lake: A Performance Review
- September 4 - Creating and Managing OLake Jobs with Docker CLI: A Practical Guide
- September 7 - How to Set Up PostgreSQL to Apache Iceberg Replication for Real-Time Analytics: Complete Guide
- September 9 - Data Ingestion From MySQL to Apache Iceberg: Optimizing Data Replication for Modern Analytics
- September 10 - Setting Up Ingestion Pipeline from MongoDB to Apache Iceberg: Step-by-Step Guide for Real-Time Analytics
- September 15 - Apache Iceberg vs. Hive: Comprehensive Comparison for Data Lakehouses
2024​
- September 1 - How to Set Up PostgreSQL CDC on AWS RDS - A Step-by-Step Guide
- September 16 - Four Critical MongoDB ETL Challenges and How to tackle them for your Data Lake and Data Warehouse?
- September 24 - How to Query Semi-Structured JSON Data in Snowflake?
- October 10 - 7 Proven Techniques for Handling Changing Data Type during Semi-Structured Data Ingestion a.k.a Polymorphic Keys
- October 18 - How to Flatten Object Types and Query Arrays in Semi-Structured Nested JSON for Effective Data Extraction
- November 5 - MongoDB Synchronization Strategies Explained - Pros, Cons, and Practical Tips
- November 11 - MongoDB CDC using Debezium and Kafka
- November 21 - Common Challenges Using Debezium and Kafka Connect for CDC
- November 22 - Problems with Debezium and How we (OLake, Open-Source) solve it?