Tony Lee
Stanford CRFM researcher and engineer; HELM benchmark co-lead.
- Role
- researcher
- Unknown
- GitHub
- github.com/teetone
- Scholar
- scholar.google.com/citations
- Papers
- 6
Cite
Notes
Only stored in your browser.
6papers
Authored papers
6AHELM: A Holistic Evaluation of Audio-Language Models
arXiv 2025
Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making
arXiv 2024
BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical Text
arXiv 2024
Relevance Filtering for Embedding-based Retrieval
arXiv 2024
Holistic Evaluation of Language Models
TMLR
WILDS: A Benchmark of in-the-Wild Distribution Shifts
arXiv 2020
Affiliations
Frequent co-authors
10from 6 papers