0

Sanmi Koyejo

Stanford CS assistant professor; works on trustworthy ML and rigorous LLM evaluation; "Emergent Abilities are a Mirage" co-author.

Role
professor
Papers
23

Cite

Notes

Only stored in your browser.

23papers

Authored papers

23

DecodingTrust-Agent Platform (DTap): A Controllable and Interactive Red-Teaming Platform for AI Agents

arXiv 2026

2026

CHI-Bench: Can AI Agents Automate End-to-End, Long-Horizon, Policy-Rich Healthcare Workflows?

arXiv 2026

2026

Latent Adversarial Regularization for Offline Preference Optimization

arXiv 2026

2026

The Leaderboard Illusion

preprint

2025

One-Minute Video Generation with Test-Time Training

CVPR 2025 1

2025

Structured Prompting Enables More Robust Evaluation of Language Models

arXiv 2025

2025

AlphaApollo: Orchestrating Foundation Models and Professional Tools into a Self-Evolving System for Deep Agentic Reasoning

arXiv 2025

2025

UQ: Assessing Language Models on Unsolved Questions

arXiv 2025

2025

Fairness through Difference Awareness: Measuring Desired Group Discrimination in LLMs

arXiv 2025

2025

Fantastic Bugs and Where to Find Them in AI Benchmarks

arXiv 2025

2025

Reliable and Efficient Amortized Model-based Evaluation

arXiv 2025

2025

Putnam-AXIOM: A Functional and Static Benchmark for Measuring Higher Level Mathematical Reasoning in LLMs

arXiv 2025

2025

End-to-End Test-Time Training for Long Context

arXiv 2025

2025

Landscape of Thoughts: Visualizing the Reasoning Process of Large Language Models

arXiv 2025

2025

Pantograph: A Machine-to-Machine Interaction Interface for Advanced Theorem Proving, High Level Reasoning, and Data Extraction in Lean 4

arXiv 2024

2024

Best-of-N Jailbreaking

arXiv 2024

2024

Learning to (Learn at Test Time): RNNs with Expressive Hidden States

arXiv 2024

2024

Context Clues: Evaluating Long Context Models for Clinical Prediction Tasks on EHRs

arXiv 2024

2024

Crossing Linguistic Horizons: Finetuning and Comprehensive Evaluation of Vietnamese Large Language Models

arXiv 2024

2024

Representation Engineering: A Top-Down Approach to AI Transparency

arXiv 2023

2023

Learning to (Learn at Test Time)

arXiv 2023

2023

HiFA: High-fidelity Text-to-3D Generation with Advanced Diffusion Guidance

arXiv 2023

2023

Principled Federated Domain Adaptation: Gradient Projection and Auto-Weighting

arXiv 2023

2023

Affiliations

Currently at

Stanford University

professor · university lab

Frequent co-authors

10

from 23 papers