Zaid Khan

PhD student at UNC Chapel Hill working on multimodal LLMs, agent evaluation, and reasoning benchmarks.

Role: grad-student
Currently at: University of North Carolina at Chapel Hill
Twitter: twitter.com/codezakh
GitHub: github.com/codezakh
Scholar: scholar.google.com/citations
Papers: 8

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: scholar.google.com/citations

Attribution policy →

8papers

Authored papers

8

Playing Along: Learning a Double-Agent Defender for Belief Steering via Theory of Mind

arXiv 2026

Cog-DRIFT: Exploration on Adaptively Reformulated Instances Enables Learning from Hard Reasoning Problems

arXiv 2026

Open Thoughts: Curating Reasoning Datasets for Open-Source R1 Replications

blog

OpenThoughts: Data Recipes for Reasoning Models

arXiv 2025

PRInTS: Reward Modeling for Long-Horizon Information Seeking

arXiv 2025

Learning to Generate Unit Tests for Automated Debugging

arXiv 2025

One Life to Learn: Inferring Symbolic World Models for Stochastic Environments from Unguided Exploration

arXiv 2025

DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback

arXiv 2024

Affiliations

Currently at

University of North Carolina at Chapel Hill

grad-student · university lab

Frequent co-authors

10

from 8 papers

Mohit Bansal

7 shared papers

Elias Stengel-Eskin

6 shared papers

Archiki Prasad

4 shared papers

Justin Chih-Yao Chen

3 shared papers

Ashima Suvarna

grad-student

2 shared papers

Etash Guha

researcher

2 shared papers

Georgios Smyrnis

grad-student

2 shared papers

Hritik Bansal

grad-student

2 shared papers

Jaemin Cho

2 shared papers

Jean Mercat

researcher

2 shared papers