Live Virtual Conference
April 29, 2026 | 9am - 3pm PT
Join the year’s premier education event
on open data architectures for data practitioners
Lineup
- 2 tracks of expert-led content
- Panels
- Workshops
- Technical sessions
Topics
- AI-native data platforms
- Data engineering for AI
- Cost and performance optimization at scale
- Maximizing openness and interoperability in your data stack
Audience
- Data & AI practitioners
- Data engineers
- Data architects
- Data platform engineers
- Analytics engineers
Speakers

Başak Tuğçe Eskili
Machine Learning Engineer


Tosh Rayadhurgam
Engineering Leader - Ranking & Foundational AI


Ruiyang Wang
Technical Staff


Vamshi Pasunuru
Staff Software Engineer


Junping (JD) Du
CoFounder & CEO


Vinoth Chandar
CEO


Maxime Beauchemin
CEO


Simba Khadder
Context Engine


Fei Han
Director of Real-Time Data Platform


Andrii Loievets
Director Software Engineering


Revanth Chandupatla
Principal Engineer


Holden Karau
Open Source Engineer


Satej Kumar Sahu
Principal Data Engineer


Kevin Liu
Principal Software Engineer


Aditi Pandit
Principal Engineer


Julien Le Dem
Principal Engineer


Mehul Batra
Software Engineer


Xinli Shang
Senior Staff Software Engineer


Kyle Weller
VP of Product


Yufei Gu
Staff Software Engineer


Rui Mo
Software Engineer


Dipankar Mazumdar
Director - Developers (Data/AI)


Junping Du
CoFounder & CEO


Will Manning
Co-founder & CEO


Suman Debnath
Technical Lead (ML)


Chang She
CEO


Will Angel
AI Engineer

.jpg)
Yuxia Luo
Software Engineer


Rahil Chertara
Senior Software Engineer


Tim Meehan
Software Engineer

Select Keynotes
From Lakehouse to Agent Infrastructure: Data Platforms for the Age of Autonomous AI
Vinoth Chandar
,
Onehouse
,
,
,
Track 1
Lorem Ipsum Dolor Sit Amet
Safe PDF Processing at Scale: A Rasterize-First Architecture
Ruiyang Wang
,
Anthropic
,
,
Scalable Table Services @Uber
Vamshi Pasunuru
,
Uber
Xinli Shang
,
Uber
,
Building multi-tenant, multi-cloud Streaming Engines at Fortune one scale
Revanth Chandupatla
,
Walmart
,
,
How Conductor transformed their data layer with Apache Hudi, Onehouse and Starrocks
Andrii Loievets
,
Conductor
,
,
The Latest Architecture Evolution of Apache Hudi at JD.com
Fei Han
,
JD
,
,
Guardrails for Agentic AI: Governing Auto-Generated SQL and Spark Jobs Before Production
Satej Kumar Sahu
,
Zalando
,
,
Vortex: Building GPU-Native Columnar Storage
Will Manning
,
Spiral (SpiralDB)
,
,
Building a Personal Data Lakehouse
Will Angel
,
DroneDeploy
,
,
Booking.com's ultra-low latency feature platform
Başak Tuğçe Eskili
,
Booking
,
,
What Happens to Your Data Architecture When Query Layer Starts Making Decisions
Tosh Rayadhurgam
,
Meta
,
,
Anatomy of our Data Agent: How AI Support Analytics at Preset
Maxime Beauchemin
,
Preset
,
,
Track 2
Lorem Ipsum Dolor Sit Amet
What’s new in Spark 4.2 / 4.3 and how to optimize your UDFS in Spark 4+
Holden Karau
,
Snowflake
,
,
,
Driving Iceberg Adoption with Open Catalog and Open Datasets
Kevin Liu
,
Microsoft
,
,
,
Column Storage for the AI era
Julien Le Dem
,
Datadog
,
,
,
Lake, Stream, and Everything In Between: Apache Fluss and the Streaming Lakehouse
Mehul Batra
,
DigitalOcean
,
,
,
Polaris Meets Hudi, Unifying Lakehouse Metadata Across Table Formats
Yufei Gu
,
Snowflake
,
,
,
Apache Gluten: Delivering Continuous Innovation in Big Data Analytics
Rui Mo
,
IBM
,
,
,
What is Really "Open" in an Open Lakehouse Architecture?
Dipankar Mazumdar
,
Cloudera
,
,
,
Metadata as the Control Plane: The Foundation of an AI-Native Data Platform
Junping (JD) Du
,
Datastrato
,
,
,
The Physics of LLM Inference at Scale
Suman Debnath
,
Anyscale
,
,
,
Managing Data at Exabyte Scale for AI Model Training
Chang She
,
LanceDB
,
,
,
3:35PM
–
4:00PM
PST
Open Data using Onehouse Cloud
If you've ever tried to build a data lakehouse, you know it's no small task. You've got to tie...
Chandra Krishnan
,
Onehouse
Register Now
Secure your spot at the premier data practitioner event! Don’t miss out on expert insights, hands-on workshops, and networking opportunities.
Apply to speak to our next edition
Are you a data practitioner with real-world lessons from designing, building, or operating the open data stack? We’d love to hear from you.
Themes we’re excited about:
- AI-native data platforms
- Data engineering for AI
- Cost and performance optimization at scale
- Maximizing openness and interoperability in your data stack



























