28-04-2025 Release notes
Breaking Changesβ
catalog.json
renamed tostreams.json
#246 β As part of improved terminology and consistency, OLake now expects stream configuration to be defined in a file namedstreams.json
instead ofcatalog.json
.
β οΈ This is a backward-incompatible change. Existing pipelines or DAGs must update the file name to continue functioning correctly.
Core OLake Changesβ
π§ Enhancements & Featuresβ
-
Hive Catalog Support
#226 β OLake now supports syncing to Apache Iceberg tables using Hive catalog integration. -
Lakekeeper Integration for REST Catalogs
REST-based catalogs can now be orchestrated with Lakekeeper for better sync lifecycle control. -
Iceberg Partitioning Logic
#227 β Native partitioning support added to Iceberg writer for optimized data layout. -
Fallback ID Generation (
olakeID
)
#225 β OLake now auto-generates a consistentolakeID
if no primary key is present in a record.
π Improvements & Fixesβ
-
Updated to Iceberg Writer
v1.7.2
#252 β Switched to Iceberg 1.7.2 to resolve an issue with S3 connections not closing promptly when using the Hive catalog. -
Handle Special Double Values (NaN, Infinity, -Infinity)
#251 β Fixed ingestion failures due to unsupported float values in source data.
Fixes: #109 -
Improved Logging for Iceberg Sync Failures
#239 β Enhanced logging output with clearer and color-coded error messages for debugging sync failures. -
Pre-commit Hook Integration
#178 β Added automated formatting and linting checks via pre-commit hooks to enforce code standards.
π Connector Releasesβ
π Postgres Connector β v0.0.4
β
- Resolved data types issues. Now more consistent, accurate and destination column data types are identical to DB ones.
π MongoDB Connector β v0.0.10
β
- Addition of primitive data type support: #247
- Improved performance in syncing high-volume collections.