Yiming Liu
- Papers
- 4
Cite
Notes
Only stored in your browser.
4papers
Authored papers
4LongCLI-Bench: A Preliminary Benchmark and Study for Long-horizon Agentic Programming in Command-Line Interfaces
arXiv 2026
Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation
CVPR 2025 1
Benchmarking Complex Instruction-Following with Multiple Constraints Composition
arXiv 2024
LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 4 papers