Summary
This 60-minute OLake panel brings together six accomplished women data engineers from healthcare, retail, cloud platforms, and large-scale enterprise systems to dissect the technical challenges behind modern data pipelines. They will cover everything from multi-terabyte CDC architectures and real-time analytics to cloud-native cost optimization, showing how sound engineering choices drive tangible business outcomes.
Chapters & Topics
Welcome and Session Overview
Moderator introduces the six panelists, outlines the focus on deep technical practices rather than high-level theory, and sets expectations for a fast-paced, Q&A-heavy hour.
Domain-Specific Pipelines
Panelists walk through specialized solutions: HIPAA-compliant clinical-trial ingestion, sub-second retail clickstream analytics, and cloud-agnostic data platforms on AWS, GCP, and Azure.
Performance Engineering Wins
Concrete examples of moving ETL from batch to streaming, shaving hours off jobs, and designing low-latency CDC paths that scale to petabytes.
Engineer’s Technical Toolkit
Progression from SQL and Python foundations to distributed-systems design, with opinions on when to generalize versus specialize in areas like Spark, Databricks, or Apache Iceberg.
Business Impact of Technical Choices
How storage-format decisions, cloud-migration tactics, and real-time architectures translate into cost savings, faster feature delivery, and measurable revenue gains.
Audience Q&A and Closing Thoughts
Live questions on tooling trade-offs, career pathways, and the future of data engineering; panelists share actionable next steps for attendees.
Action Items
- Organizers to share session recording and slide deck with registrants after the event.