Skip to main content

What are Catalogs?

A catalog is a entity from where OLake Fusion fetches Iceberg tables. It stores reference to the latest metadata file which stores info such as table names, schemas, and file locations. OLake Fusion must know which catalog (and database) a table belongs to before maintenance can run on it.

Catalogs are managed from the Maintenance section in the OLake UI. There are two types: OLake Imported Catalogs and External Catalogs.

OLake Imported Catalogs​

OLake Imported Catalogs are catalog entries whose credentials come from an Iceberg destination already configured in OLake Ingestion—imported into Maintenance instead of re-entering connection details.

When Iceberg ingestion is already set up, that destination's catalog can be imported this way.

How to add an OLake Imported Catalog​

  1. In the OLake UI sidebar, open the Maintenance dropdown and go to the Catalogs page.

  2. Click New Catalog. Catalogs page

  3. Under Import Catalog from destination, open the dropdown and select the destination. OLake Fusion imports all catalog credentials from that destination. Catalog dropdown

    JDBC Catalog — Catalog Name Must Match

    For JDBC catalogs, the catalog name must match the one used during Ingestion. If no catalog name was provided during ingestion, the default is olake_iceberg.

  4. Click Connect to save and validate the catalog. Save Catalog

Note: Duplicate catalog names are not allowed. If a catalog with the same name already exists, the new catalog will not be added.

On the Catalogs page, OLake Imported Catalogs are shown with OLake beside their name to distinguish them from External Catalogs.

OLake Imported Catalog with OLake label on Catalogs page

External Catalogs​

External Catalogs are catalogs added by entering connection details manually from the Catalogs page (without using Import Catalog from destination).

Use an external catalog when:

  • Maintenance is needed on Iceberg tables created outside OLake.
  • The same catalog is used by other systems and OLake Fusion should compact those tables without replicating data via Ingestion.

How to add an External Catalog​

  1. In the OLake UI sidebar, open the Maintenance dropdown and go to the Catalogs page.
  2. Click New Catalog.
  3. Enter the catalog details manually (for example, catalog name, type, and connection settings). Do not use Import Catalog from destination.

Add catalog view

  1. Click Connect to save and validate the catalog.

Catalog connected view

Once a catalog is connected—whether imported or added as external—it appears in the Select Catalog dropdown on the Tables page. Selecting a catalog (and then a database) lists the tables available for maintenance configuration.

Compatibility to Iceberg Catalogs​

OLake Fusion supports multiple Iceberg catalog implementations, including REST catalog, Hive Metastore, and JDBC Catalog, letting you choose the one that best fits your environment. The table below shows the supported catalogs at a glance, with links to their setup guides.

Catalog
AWS Glue logoAWS Glue
REST (Generic)
Nessie logoREST Nessie
Polaris logoREST Polaris
Unity Catalog logoREST Unity
Lakekeeper logoREST Lakekeeper
Amazon S3 logoS3 Tables
JDBC logoJDBC
Apache Hive logoHive Metastore

For a full walkthrough that includes adding a catalog and configuring the first table maintenance, see Configure Your First Table Maintenance.



đź’ˇ Join the OLake Community!

Got questions, ideas, or just want to connect with other data engineers?
👉 Join our Slack Community to get real-time support, share feedback, and shape the future of OLake together. 🚀

Your success with OLake is our priority. Don’t hesitate to contact us if you need any help or further clarification!