Skip to main content

9 posts tagged with "Apache Iceberg"

Blogs on the topic Apache Iceberg

View All Tags
Apache Iceberg vs. Hive: Comprehensive Comparison for Data Lakehouses
32 min

Apache Iceberg vs. Hive: Comprehensive Comparison for Data Lakehouses

Apache Hive and Apache Iceberg represent two different generations of the data lake ecosystem. Hive was born in the Hadoop era as a SQL abstraction over HDFS, excelling in batch ETL workloads and still valuable for organizations with large Hadoop/ORC footprints. Iceberg, by contrast, emerged in the cloud-native era as an open table format designed for multi-engine interoperability, schema evolution, and features like time travel. If you are running a legacy Hadoop stack with minimal need for engine diversity, Hive remains a practical choice. If you want a flexible, future-proof data lakehouse that supports diverse engines, reliable transactions, and governance at scale, Iceberg is the more strategic investment.

Akshay Kumar Sharma
Akshay Kumar Sharma
Sep 15, 2025
Setting Up Ingestion Pipeline from MongoDB to Apache Iceberg: Step-by-Step Guide for Real-Time Analytics
15 min

Setting Up Ingestion Pipeline from MongoDB to Apache Iceberg: Step-by-Step Guide for Real-Time Analytics

MongoDB has become the go-to database for modern applications, handling everything from user profiles to IoT sensor data with its flexible document model. But when it comes to analytics at scale, MongoDB's document-oriented architecture faces significant challenges with complex queries, aggregations, and large-scale data processing.

Rohan Khameshra
Rohan Khameshra
Sep 10, 2025