What is OLake Fusion?β
OLake Fusion is an automated Iceberg table maintenance platform that keeps your lakehouse tables efficient, compact, and query-ready as data continuously grows. As Iceberg tables evolve through ingestion, updates, and deletes, they accumulate small files, delete files, and excess metadata β all of which degrade query performance and increase storage overhead over time.
OLake Fusion takes care of all of this, ensuring your destination Iceberg tables stay maintained, performant, and efficiently queriable as they scale.
When is Table Maintenance Required?β
In Apache Iceberg, table maintenance should be performed periodically to ensure consistent query performance and efficient storage as data evolves.
You should consider running maintenance in the following scenarios:
-
Frequent Data Ingestion or Updates
-
Accumulation of Small Files
-
Presence of Delete Files
-
High Partition Cardinality
-
Degrading Query Performance
-
Growing Table Size Over Time
Regular maintenance ensures that Iceberg tables remain compacted, scalable, and performant for analytical workloads.
Iceberg Maintenance is available starting from v0.4.0. Upgrade OLake UI to access the Maintenance module.
- Existing users (Docker): If you are already using OLake Go for Ingestion, follow the upgrade guide to access the Maintenance module.
- Existing users (Helm / Kubernetes): If you are running OLake Go on Kubernetes, follow the chart upgrade guide to access the Maintenance module.
- New Users (Docker): Follow the quickstart guide to get started.
- New Users (Helm / Kubernetes): Follow the quickstart guide to get started.