Live Virtual Conference
April 29, 2026 | 9am - 3pm PT
Join the year’s premier education event
on open data architectures for data practitioners
Lineup
- 2 tracks of expert-led content
- Panels
- Workshops
- Technical sessions
Topics
- AI-native data platforms
- Data engineering for AI
- Cost and performance optimization at scale
- Maximizing openness and interoperability in your data stack
Audience
- Data & AI practitioners
- Data engineers
- Data architects
- Data platform engineers
- Analytics engineers
Speakers

Başak Tuğçe Eskili
Machine Learning Engineer


Tosh Rayadhurgam
Head of Advanced AI


Ruiyang Wang
Member of Technical Staff


Vamshi Pasunuru
Staff Software Engineer


Junping (JP) Du
CoFounder & CEO


Vinoth Chandar
CEO


Maxime Beauchemin
CEO


Simba Khadder
Head of Context Engine


Fei Han
Director of Real-Time Data Platform


Andrii Loievets
Staff Software Engineer


Revanth Chandupatla
Principal Engineer


Holden Karau
Principal Engineer OSS Spark


Satej Kumar Sahu
Principal Data Engineer


Kevin Liu
Principal Engineer


Aditi Pandit
Principal Engineer


Julien Le Dem
Principal Engineer


Mehul Batra
Software Engineer


Xinli Shang
Senior Staff Software Engineer


Kyle Weller
VP of Product


Yufei Gu
Staff Software Engineer


Rui Mo
Software Engineer


Dipankar Mazumdar
Director - Developers (Data/AI)


Junping Du
Co-Founder and CEO


Will Manning
Co-founder & CEO


Suman Debnath
Technical Lead (ML)


Chang She
CEO


Will Angel
AI Engineer

.jpg)
Yuxia Luo
Software Engineer


Rahil Chertara
Senior Software Engineer


Tim Meehan
Staff Software Engineer

Select Keynotes
From Lakehouse to Agent Infrastructure: Data Platforms for the Age of Autonomous AI
Vinoth Chandar
,
Onehouse
,
,
,
Track 1
Lorem Ipsum Dolor Sit Amet
3:00 PM
–
3:25 PM
PST
Safe PDF Processing at Scale: A Rasterize-First Architecture
Ruiyang Wang
,
Anthropic
,
,
1:55 PM
–
2:20 PM
PST
Scalable Table Services @Uber
Vamshi Pasunuru
,
Uber
Xinli Shang
,
Uber
,
2:20 PM
–
2:35 PM
PST
Building multi-tenant, multi-cloud Streaming Engines at Fortune one scale
Revanth Chandupatla
,
Walmart
,
,
10:50 AM
–
11:15 AM
PST
How Conductor transformed their data layer with Apache Hudi, Onehouse and Starrocks
Andrii Loievets
,
Conductor
,
,
10:25 AM
–
10:50 AM
PST
The Latest Architecture Evolution of Apache Hudi at JD.com
Fei Han
,
JD
,
,
11:15 AM
–
11:30 AM
PST
Booking.com's ultra-low latency feature platform
Başak Tuğçe Eskili
,
Booking
,
,
12:25 PM
–
12:50 PM
PST
What Happens to Your Data Architecture When Query Layer Starts Making Decisions
Tosh Rayadhurgam
,
ex Meta
,
,
10:00 AM
–
10:25 AM
PST
Guardrails for Agentic AI: Governing Auto-Generated SQL and Spark Jobs Before Production
Satej Kumar Sahu
,
Zalando
,
,
12:50 PM
–
1:15 PM
PST
Vortex: Building GPU-Native Columnar Storage
Will Manning
,
Spiral (SpiralDB)
,
,
12:10 PM
–
12:25 PM
PST
Building a Personal Data Lakehouse
Will Angel
,
DroneDeploy
,
,
2:35 PM
–
3:00 PM
PST
Anatomy of our Data Agent: How AI Support Analytics at Preset
Maxime Beauchemin
,
Preset
,
,
Track 2
Lorem Ipsum Dolor Sit Amet
1:55 PM
–
2:20 PM
PST
What’s new in Spark 4.2 / 4.3 and how to optimize your UDFS in Spark 4+
Holden Karau
,
Snowflake
,
,
,
12:10 PM
–
12:25 PM
PST
Driving Iceberg Adoption with Open Catalog and Open Datasets
Kevin Liu
,
Microsoft
,
,
,
10:25 AM
–
10:50 AM
PST
Column Storage for the AI era
Julien Le Dem
,
Datadog
,
,
,
11:55 AM
–
12:10 PM
PST
Lake, Stream, and Everything In Between: Apache Fluss and the Streaming Lakehouse
Mehul Batra
,
DigitalOcean
,
,
,
12:50 PM
–
1:15 PM
PST
Polaris Meets Hudi, Unifying Lakehouse Metadata Across Table Formats
Yufei Gu
,
Snowflake
,
,
,
10:50 AM
–
11:15 AM
PST
Apache Gluten: Delivering Continuous Innovation in Big Data Analytics
Rui Mo
,
IBM
,
,
,
What is Really "Open" in an Open Lakehouse Architecture?
Dipankar Mazumdar
,
Cloudera
,
,
,
2:20 PM
–
2:35 PM
PST
Metadata as the Control Plane: The Foundation of an AI-Native Data Platform
Junping (JP) Du
,
Datastrato
,
,
,
11:40 AM
–
11:55 AM
PST
The Physics of LLM Inference at Scale
Suman Debnath
,
Anyscale
,
,
,
12:25 PM
–
12:50 PM
PST
Managing Data at Exabyte Scale for AI Model Training
Chang She
,
LanceDB
,
,
,
2:35 PM
–
3:00 PM
PST
Apache Hudi™ for the next generation of AI: Unstructured Data and Vector Search on open lakehouse storage
Rahil Chertara
,
Onehouse
Timothy Brown
,
General Intuiton
,
,
3:35PM
–
4:00PM
PST
Open Data using Onehouse Cloud
If you've ever tried to build a data lakehouse, you know it's no small task. You've got to tie...
Chandra Krishnan
,
Onehouse
Register Now
Secure your spot at the premier data practitioner event! Don’t miss out on expert insights, hands-on workshops, and networking opportunities.
Apply to speak to our next edition
Are you a data practitioner with real-world lessons from designing, building, or operating the open data stack? We’d love to hear from you.
Themes we’re excited about:
- AI-native data platforms
- Data engineering for AI
- Cost and performance optimization at scale
- Maximizing openness and interoperability in your data stack






























