Yiyuan Li
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5ClawsBench: Evaluating Capability and Safety of LLM Productivity Agents in Simulated Workspaces
arXiv 2026
One Sample to Rule Them All: Extreme Data Efficiency in RL Scaling
arXiv 2026
HLE-Verified: A Systematic Verification and Structured Revision of Humanity's Last Exam
arXiv 2026
FRoG: Evaluating Fuzzy Reasoning of Generalized Quantifiers in Large Language Models
arXiv 2024
Pragmatic Reasoning Unlocks Quantifier Semantics for Foundation Models
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers