Skip to main content

Starburst Enterprise SEP 414-E+

End-to-end Iceberg analytics platform with comprehensive catalog support, full DML operations, enterprise governance, and advanced optimization features

Key Features

100
Full Integration

Comprehensive Catalog Support

Hive Metastore, AWS Glue, JDBC, REST, Nessie, Snowflake, and Starburst Galaxy managed metastore with flexible configuration via iceberg.catalog.type

Explore details
100
Full Support

Complete DML Operations

INSERT, UPDATE, DELETE, MERGE all supported with intelligent partition-aligned predicates and Iceberg v2 position/equality-delete files

Explore details
95
Built-in Access Control

Enterprise Security & Governance

Built-in access-control engine with table/column-level ACLs, LDAP/OAuth integration, and support for Lake Formation and HMS Ranger policies

Explore details
100
SQL Syntax

Advanced Time Travel

Query past snapshots using FOR VERSION AS OF or FOR TIMESTAMP AS OF with metadata tables ($snapshots, $history, $manifests) and maintenance procedures

Explore details
100
MoR & CoW

Adaptive Storage Strategies

Default copy-on-write for large rewrites with fine-grained updates creating separate delete files (MoR) merged at query time; handles position & equality deletes

Explore details
85
Multi-Format

Format Compatibility & Codecs

Supports Iceberg spec v1 & v2 with data files in Parquet (default), ORC, Avro and configurable codecs including SNAPPY, ZSTD, LZ4, GZIP

Explore details
95
Warp Speed

Performance Optimization Suite

Dynamic filtering, bucket-aware execution, metadata caching, automatic statistics, Warp Speed indexing, and materialized views for enterprise performance

Explore details
70
Known Constraints

Current Limitations & Roadmap

One catalog per config file; v3 preview only; manual optimization for frequent commits; some nested struct predicate limitations; streaming via external tools only

Explore details

Starburst Iceberg Feature Matrix

Comprehensive breakdown of Iceberg capabilities in Starburst Enterprise SEP 414-E+

Dimension
Support Level
Implementation Details
Min Version
Catalog Types
FullUniversal
Hive, Glue, JDBC, REST, Nessie, Snowflake, Galaxy managed metastore
414-E+
Read & Write Operations
FullComplete
CREATE TABLE, CTAS, INSERT, and all query operations with atomic metadata swap
414-E+
DML Operations
FullAll Operations
INSERT, UPDATE, DELETE, MERGE with intelligent partition-aligned predicates
414-E+
MoR/CoW Storage
FullAdaptive
Default CoW for large rewrites; MoR for fine-grained updates with delete files
414-E+
Time Travel
FullSQL Native
FOR VERSION/TIMESTAMP AS OF syntax with metadata tables and procedures
414-E+
Security & Governance
FullEnterprise
Built-in access control, table/column ACLs, LDAP/OAuth, Lake Formation, Ranger
414-E+
Format Support
v1/v2Multi-Format
Iceberg v1 & v2, Parquet/ORC/Avro, configurable codecs (SNAPPY, ZSTD, LZ4, GZIP)
414-E+
Performance Optimization
FullWarp Speed
Dynamic filtering, bucket execution, metadata caching, auto statistics, Warp Speed
414-E+
Materialized Views
FullIncremental
Materialized views with incremental refresh for performance optimization
414-E+
Streaming Support
NoneExternal Only
No built-in streaming; queries snapshots from external tools
N/A
Iceberg v3 Support
PreviewRead-Only
v3 preview metadata reading under feature flag; production GA roadmap 2025
430+
Catalog Configuration
LimitedOne Per File
One catalog configuration per connector file; multiple connectors for multi-catalog
414-E+

Showing 12 entries

Use Cases

Enterprise Data Warehousing

Comprehensive analytics platform with full DML capabilities and enterprise governance

  • Large-scale data warehousing with complex transformations
  • Multi-tenant environments with strict security requirements
  • Enterprise reporting and business intelligence platforms
  • Data governance scenarios requiring detailed access control

Multi-Cloud & Hybrid Analytics

Unified analytics across diverse catalog and storage environments

  • Multi-cloud deployments with different catalog systems
  • Hybrid on-premises and cloud data architectures
  • Migration scenarios from legacy systems to modern lakehouses
  • Federation across multiple metadata and storage systems

High-Performance Analytics

Performance-critical workloads requiring sub-second response times

  • Interactive business intelligence with large datasets
  • Real-time dashboards and operational analytics
  • Complex analytical queries with advanced optimizations
  • Performance-sensitive applications with SLA requirements

Compliance & Audit Scenarios

Regulatory environments requiring comprehensive audit trails and access control

  • Financial services regulatory reporting
  • Healthcare data governance and compliance
  • Audit trail requirements with time travel capabilities
  • Data lineage and governance for compliance frameworks

Need Assistance?

If you have any questions or uncertainties about setting up OLake, contributing to the project, or troubleshooting any issues, we’re here to help. You can:

  • Email Support: Reach out to our team at hello@olake.io for prompt assistance.
  • Join our Slack Community: where we discuss future roadmaps, discuss bugs, help folks to debug issues they are facing and more.
  • Schedule a Call: If you prefer a one-on-one conversation, schedule a call with our CTO and team.

Your success with OLake is our priority. Don’t hesitate to contact us if you need any help or further clarification!