Yiming Liu

Cite

Notes

Only stored in your browser.

Attribution

4papers

Authored papers

LongCLI-Bench: A Preliminary Benchmark and Study for Long-horizon Agentic Programming in Command-Line Interfaces

arXiv 2026

Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation

CVPR 2025 1

Benchmarking Complex Instruction-Following with Multiple Constraints Composition

arXiv 2024

LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models

arXiv 2024

No known affiliations.

from 4 papers

Hongning Wang

Jie Tang

engineer

Minlie Huang

Xiaotao Gu

Alejandro Lozano

Anjiang Wei

Binxin Hu

Bosi Wen

Chenyu Wang

Chuanhao Li