Starburst Enterprise SEP 414-E+
End-to-end Iceberg analytics platform with comprehensive catalog support, full DML operations, enterprise governance, and advanced optimization features
Key Features
Comprehensive Catalog Support
Hive Metastore, AWS Glue, JDBC, REST, Nessie, Snowflake, and Starburst Galaxy managed metastore with flexible configuration via iceberg.catalog.type
Complete DML Operations
INSERT, UPDATE, DELETE, MERGE all supported with intelligent partition-aligned predicates and Iceberg v2 position/equality-delete files
Enterprise Security & Governance
Built-in access-control engine with table/column-level ACLs, LDAP/OAuth integration, and support for Lake Formation and HMS Ranger policies
Advanced Time Travel
Query past snapshots using FOR VERSION AS OF or FOR TIMESTAMP AS OF with metadata tables ($snapshots, $history, $manifests) and maintenance procedures
Adaptive Storage Strategies
Default copy-on-write for large rewrites with fine-grained updates creating separate delete files (MoR) merged at query time; handles position & equality deletes
Format Compatibility & Codecs
Supports Iceberg spec v1 & v2 with data files in Parquet (default), ORC, Avro and configurable codecs including SNAPPY, ZSTD, LZ4, GZIP
Performance Optimization Suite
Dynamic filtering, bucket-aware execution, metadata caching, automatic statistics, Warp Speed indexing, and materialized views for enterprise performance
Current Limitations & Roadmap
One catalog per config file; v3 preview only; manual optimization for frequent commits; some nested struct predicate limitations; streaming via external tools only
Starburst Iceberg Feature Matrix
Comprehensive breakdown of Iceberg capabilities in Starburst Enterprise SEP 414-E+
Dimension | Support Level | Implementation Details | Min Version |
---|---|---|---|
Catalog Types | FullUniversal | Hive, Glue, JDBC, REST, Nessie, Snowflake, Galaxy managed metastore | 414-E+ |
Read & Write Operations | FullComplete | CREATE TABLE, CTAS, INSERT, and all query operations with atomic metadata swap | 414-E+ |
DML Operations | FullAll Operations | INSERT, UPDATE, DELETE, MERGE with intelligent partition-aligned predicates | 414-E+ |
MoR/CoW Storage | FullAdaptive | Default CoW for large rewrites; MoR for fine-grained updates with delete files | 414-E+ |
Time Travel | FullSQL Native | FOR VERSION/TIMESTAMP AS OF syntax with metadata tables and procedures | 414-E+ |
Security & Governance | FullEnterprise | Built-in access control, table/column ACLs, LDAP/OAuth, Lake Formation, Ranger | 414-E+ |
Format Support | v1/v2Multi-Format | Iceberg v1 & v2, Parquet/ORC/Avro, configurable codecs (SNAPPY, ZSTD, LZ4, GZIP) | 414-E+ |
Performance Optimization | FullWarp Speed | Dynamic filtering, bucket execution, metadata caching, auto statistics, Warp Speed | 414-E+ |
Materialized Views | FullIncremental | Materialized views with incremental refresh for performance optimization | 414-E+ |
Streaming Support | NoneExternal Only | No built-in streaming; queries snapshots from external tools | N/A |
Iceberg v3 Support | PreviewRead-Only | v3 preview metadata reading under feature flag; production GA roadmap 2025 | 430+ |
Catalog Configuration | LimitedOne Per File | One catalog configuration per connector file; multiple connectors for multi-catalog | 414-E+ |
Showing 12 entries
Use Cases
Enterprise Data Warehousing
Comprehensive analytics platform with full DML capabilities and enterprise governance
- Large-scale data warehousing with complex transformations
- Multi-tenant environments with strict security requirements
- Enterprise reporting and business intelligence platforms
- Data governance scenarios requiring detailed access control
Multi-Cloud & Hybrid Analytics
Unified analytics across diverse catalog and storage environments
- Multi-cloud deployments with different catalog systems
- Hybrid on-premises and cloud data architectures
- Migration scenarios from legacy systems to modern lakehouses
- Federation across multiple metadata and storage systems
High-Performance Analytics
Performance-critical workloads requiring sub-second response times
- Interactive business intelligence with large datasets
- Real-time dashboards and operational analytics
- Complex analytical queries with advanced optimizations
- Performance-sensitive applications with SLA requirements
Compliance & Audit Scenarios
Regulatory environments requiring comprehensive audit trails and access control
- Financial services regulatory reporting
- Healthcare data governance and compliance
- Audit trail requirements with time travel capabilities
- Data lineage and governance for compliance frameworks