Zaid Khan
PhD student at UNC Chapel Hill working on multimodal LLMs, agent evaluation, and reasoning benchmarks.
- Role
- grad-student
- Currently at
- University of North Carolina at Chapel Hill
- twitter.com/codezakh
- GitHub
- github.com/codezakh
- Scholar
- scholar.google.com/citations
- Papers
- 8
Cite
Notes
Only stored in your browser.
Authored papers
8Playing Along: Learning a Double-Agent Defender for Belief Steering via Theory of Mind
arXiv 2026
Cog-DRIFT: Exploration on Adaptively Reformulated Instances Enables Learning from Hard Reasoning Problems
arXiv 2026
Open Thoughts: Curating Reasoning Datasets for Open-Source R1 Replications
blog
OpenThoughts: Data Recipes for Reasoning Models
arXiv 2025
PRInTS: Reward Modeling for Long-Horizon Information Seeking
arXiv 2025
Learning to Generate Unit Tests for Automated Debugging
arXiv 2025
One Life to Learn: Inferring Symbolic World Models for Stochastic Environments from Unguided Exploration
arXiv 2025
DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback
arXiv 2024
Affiliations
Frequent co-authors
10from 8 papers
Mohit Bansal
Elias Stengel-Eskin
Archiki Prasad
Justin Chih-Yao Chen
Ashima Suvarna
grad-student
Etash Guha
researcher
Georgios Smyrnis
grad-student
Hritik Bansal
grad-student
Jaemin Cho
Jean Mercat
researcher