Skip to main content
source.json - Parquet Format
{
"bucket_name": "source-data",
"region": "us-east-1",
"path_prefix": "parquet/",
"access_key_id": "minioadmin",
"secret_access_key": "minioadmin",
"endpoint": "http://localhost:9000",
"file_format": "parquet",
"max_threads": 5,
"retry_count": 3
}

Parquet Features:

  • Schema automatically read from Parquet file metadata
  • No additional configuration required
  • Supports all Parquet data types
  • Efficient streaming for large files using S3 range requests
  • Native columnar reading preserves type information


πŸ’‘ Join the OLake Community!

Got questions, ideas, or just want to connect with other data engineers?
πŸ‘‰ Join our Slack Community to get real-time support, share feedback, and shape the future of OLake together. πŸš€

Your success with OLake is our priority. Don’t hesitate to contact us if you need any help or further clarification!