Matei Zaharia
Co-founder and CTO of Databricks; UC Berkeley professor; creator of Apache Spark; co-author of MLflow and DSPy.
- Role
- founder
- Currently at
- Databricks
- twitter.com/matei_zaharia
- Scholar
- scholar.google.com/citations
- Papers
- 18
Cite
Notes
Only stored in your browser.
Authored papers
18OfficeQA Pro: An Enterprise Benchmark for End-to-End Grounded Reasoning
arXiv 2026
DeepScholar-Bench: A Live Benchmark and Automated Evaluation for Generative Research Synthesis
arXiv 2025
Why Do Multi-Agent LLM Systems Fail?
arXiv 2025
EXP-Bench: Can AI Conduct AI Research Experiments?
arXiv 2025
WARP: An Efficient Engine for Multi-Vector Retrieval
arXiv 2025
Adaptive Semantic Prompt Caching with VectorQ
arXiv 2025
GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning
arXiv 2025
Optimizing Model Selection for Compound AI Systems
arXiv 2025
Multi-module GRPO: Composing Policy Gradients and Prompt Optimization for Language Model Programs
arXiv 2025
World Model on Million-Length Video And Language With Blockwise RingAttention
arXiv 2024
Semantic Operators: A Declarative Model for Rich, AI-based Data Processing
arXiv 2024
Text2SQL is Not Enough: Unifying AI and Databases with TAG
arXiv 2024
Ring Attention with Blockwise Transformers for Near-Infinite Context
arXiv 2023
ARES: An Automated Evaluation Framework for Retrieval-Augmented Generation Systems
arXiv 2023
How is ChatGPT's behavior changing over time?
arXiv 2023
Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP
arXiv 2022
PLAID: An Efficient Engine for Late Interaction Retrieval
arXiv 2022
MegaBlocks: Efficient Sparse Training with Mixture-of-Experts
arXiv 2022
Affiliations
Frequent co-authors
10from 18 papers
Christopher Potts
Omar Khattab
Ion Stoica
professor / co-founder
Carlos Guestrin
Dan Klein
Joseph E. Gonzalez
Lakshya A. Agrawal
Liana Patel
Arnav Singhvi
Dilara Soylu
grad-student