OLake (v0.3.17 - v0.3.18)
February 10, 2026 β February 20, 2026
π― What's Newβ
Sourcesβ
-
Kafka Avro schema normalization -
Avro decoding could fail when the schemaname/namespacecontained invalid characters (for example,-). Fixed this by replacing unsupported characters with_so schemas follow Avro naming conventions and can be decoded. -
MSSQL connector integration tests -
Added integration tests for the MSSQL driver covering full refresh, incremental, and CDC sync to Iceberg and Parquet, including datatype coverage plus CRUD and schema evolution cases. -
Schema discovery parallelism -
Added--max-discover-threadsto control parallel stream discovery at runtime (default is 50). If you have many streams, you can increase this value to speed up discovery.
Catalogsβ
- Glue catalog custom endpoint and credentials -
Added optionalglue_*configs (endpoint, creds, catalog ID) so Glue catalog access can use separate credentials/endpoint from S3, while keeping default behavior unchanged when not set.
Destinationsβ
- Lakekeeper catalog image tag to v0.11.1 -
Replaced thelatestLakekeeper/catalog image tag withv0.11.1.
π§ Bug Fixes & Stabilityβ
-
Parquet schema extra column fix for normalization -
With normalization enabled, Parquet output incorrectly included an extra data column. Fixed this by initializing default OLake columns upfront and only add data when normalization is disabled. -
Primary key order fix for chunk processing (MSSQL, DB2) -
Fixed non-deterministic composite primary key ordering during backfill by sorting primary key columns inChunkIterator, ensuring consistent chunk comparisons and scanning. -
MySQL timezone offset parsing fix -
Fixed MySQL timezone offsets not being parsed and falling back to UTC by correctly parsing offsets and using them for timezone conversion, so CDC/full refresh use the right timezone β -
Performance test CI destination config fix -
Updated performance test CI to generateiceberg_destination.json(instead ofdestination.json), matching the path expected by the test utilities config. -
MSSQL physloc chunk encoding fix -
Hex-encoded%%physloc%%chunk boundary values to prevent invalid UTFβ8 in gRPC/protobuf strings, with a fallback to re-encode older state chunks for compatibility.