Live Virtual Conference
April 29, 2026 | 9am - 3pm PT
Join the year’s premier education event
on open data architectures for data practitioners
Lineup
- 2 tracks of expert-led content
- Panels
- Workshops
- Technical sessions
Topics
- AI-native data platforms
- Data engineering for AI
- Cost and performance optimization at scale
- Maximizing openness and interoperability in your data stack
Audience
- Data & AI practitioners
- Data engineers
- Data architects
- Data platform engineers
- Analytics engineers
Speakers

Başak Tuğçe Eskili
Machine Learning Engineer


Tosh Rayadhurgam
Head of Advanced AI


Ruiyang Wang
Member of Technical Staff


Vamshi Pasunuru
Staff Software Engineer


Junping (JP) Du
CoFounder & CEO


Vinoth Chandar
CEO


Maxime Beauchemin
CEO


Simba Khadder
Head of Context Engine


Fei Han
Director of Real-Time Data Platform


Andrii Loievets
Staff Software Engineer


Revanth Chandupatla
Principal Engineer


Holden Karau
Principal Engineer OSS Spark


Sagar Lakshmipathy
Solutions Engineer


Satej Kumar Sahu
Principal Data Engineer


Kevin Liu
Principal Engineer


Aditi Pandit
Principal Engineer


Julien Le Dem
Principal Engineer


Mehul Batra
Software Engineer


Xinli Shang
Senior Staff Software Engineer


Kyle Weller
VP of Product


Yufei Gu
Staff Software Engineer


Rui Mo
Software Engineer


Dipankar Mazumdar
Director - Developers (Data/AI)


Junping Du
Co-Founder and CEO


Will Manning
Co-founder & CEO


Suman Debnath
Technical Lead (ML)


Chang She
CEO


Will Angel
AI Engineer


Alex Jones
Tech lead ML platform

.jpg)
Yuxia Luo
Software Engineer


Rahil Chertara
Senior Software Engineer


Tim Meehan
Staff Software Engineer


Timothy Brown
Database Engineer
.png)
Select Keynotes
9:30 AM
–
10:00 AM
PST
From Lakehouse to Agent Infrastructure: Data Platforms for the Age of Autonomous AI
Vinoth Chandar
,
Onehouse
,
,
,
Track 1
Lorem Ipsum Dolor Sit Amet
10:00 AM
–
10:25 AM
PST
Guardrails for Agentic AI: Governing Auto-Generated SQL and Spark Jobs Before Production
Satej Kumar Sahu
,
Zalando
,
,
10:25 AM
–
10:50 AM
PST
The Latest Architecture Evolution of Apache Hudi at JD.com
Fei Han
,
JD
,
,
10:50 AM
–
11:15 AM
PST
How Conductor transformed their data layer with Apache Hudi, Onehouse and Starrocks
Andrii Loievets
,
Conductor
,
,
11:15 AM
–
11:30 AM
PST
Booking.com's ultra-low latency feature platform
Başak Tuğçe Eskili
,
Booking
,
,
12:10 PM
–
12:25 PM
PST
Building a Personal Data Lakehouse
Will Angel
,
DroneDeploy
,
,
12:25 PM
–
12:50 PM
PST
What Happens to Your Data Architecture When Query Layer Starts Making Decisions
Tosh Rayadhurgam
,
ex Meta
,
,
12:50 PM
–
1:15 PM
PST
Vortex: Building GPU-Native Columnar Storage
Will Manning
,
Spiral (SpiralDB)
,
,
1:55 PM
–
2:20 PM
PST
Scalable Table Services @Uber
Vamshi Pasunuru
,
Uber
Xinli Shang
,
Uber
,
2:20 PM
–
2:35 PM
PST
Building multi-tenant, multi-cloud Streaming Engines at biggest retailer on this planet scale
Revanth Chandupatla
,
Walmart
,
,
2:35 PM
–
3:00 PM
PST
Anatomy of our Data Agent: How AI Support Analytics at Preset
Maxime Beauchemin
,
Preset
,
,
3:00 PM
–
3:25 PM
PST
Safe PDF Processing at Scale: A Rasterize-First Architecture
Ruiyang Wang
,
Anthropic
,
,
Track 2
Lorem Ipsum Dolor Sit Amet
10:00 AM
–
10:25 AM
PST
Building a Context Engine: Data Pipelines for Agents
Simba Khadder
,
Redis
,
,
,
10:25 AM
–
10:50 AM
PST
Column Storage for the AI era
Julien Le Dem
,
Datadog
,
,
,
10:50 AM
–
11:15 AM
PST
Apache Gluten: Delivering Continuous Innovation in Big Data Analytics
Rui Mo
,
IBM
,
,
,
11:15 AM
–
11:30 AM
PST
What is Really "Open" in an Open Lakehouse Architecture?
Dipankar Mazumdar
,
Cloudera
,
,
,
11:40 AM
–
11:55 AM
PST
The Physics of LLM Inference at Scale
Suman Debnath
,
Anyscale
,
,
,
11:55 AM
–
12:10 PM
PST
Lake, Stream, and Everything In Between: Apache Fluss and the Streaming Lakehouse
Mehul Batra
,
DigitalOcean
,
,
,
12:10 PM
–
12:25 PM
PST
Driving Iceberg Adoption with Open Catalog and Open Datasets
Kevin Liu
,
Microsoft
,
,
,
12:25 PM
–
12:50 PM
PST
Managing Data at Exabyte Scale for AI Model Training
Chang She
,
LanceDB
,
,
,
12:50 PM
–
1:15 PM
PST
Polaris Meets Hudi, Unifying Lakehouse Metadata Across Table Formats
Yufei Gu
,
Snowflake
,
,
,
1:55 PM
–
2:20 PM
PST
What’s new in Spark 4.2 / 4.3 and how to optimize your UDFS in Spark 4+
Holden Karau
,
Snowflake
,
,
,
2:20 PM
–
2:35 PM
PST
Metadata as the Control Plane: The Foundation of an AI-Native Data Platform
Junping (JP) Du
,
Datastrato
,
,
,
3:00 PM
–
3:25 PM
PST
The latest on the Presto Native Engine
Aditi Pandit
,
IBM
,
,
,
2:35 PM
–
3:00 PM
PST
Apache Hudi™ for the next generation of AI: Unstructured Data and Vector Search on open lakehouse storage
Rahil Chertara
,
Onehouse
Timothy Brown
,
General Intuiton
,
,
3:35PM
–
4:00PM
PST
Open Data using Onehouse Cloud
If you've ever tried to build a data lakehouse, you know it's no small task. You've got to tie...
Chandra Krishnan
,
Onehouse
10:00 AM
–
10:25 AM
PST
Workshop: Supercharge Apache Spark on Kubernetes with the Quanton Operator
If you've ever tried to build a data lakehouse, you know it's no small task. You've got to tie...
Sagar Lakshmipathy
,
Onehouse
Workshop
10:00 AM
–
10:25 AM
PST
and
3:35 PM
–
4:00 PM
PST
Workshop: Supercharge Apache Spark on Kubernetes with the Quanton Operator
If you've ever tried to build a data lakehouse, you know it's no small task. You've got to tie...
Sagar Lakshmipathy
,
Onehouse
Register Now
Secure your spot at the premier data practitioner event! Don’t miss out on expert insights, hands-on workshops, and networking opportunities.
Apply to speak to our next edition (Fall 2026)
Are you a data practitioner with real-world lessons from designing, building, or operating the open data stack? We’d love to hear from you.
Themes we’re excited about:
- AI-native data platforms
- Data engineering for AI
- Cost and performance optimization at scale
- Maximizing openness and interoperability in your data stack

.png)





























