Yuansheng Ni
TIGER-Lab / Waterloo researcher; co-author on MMLU-Pro and MEGA-Bench benchmarks.
- Role
- researcher
- Currently at
- TIGER-Lab
- Unknown
- GitHub
- Unknown
- Scholar
- scholar.google.com/scholar
- Papers
- 7
Cite
Notes
Only stored in your browser.
7papers
Authored papers
7PhyX: Does Your Model Have the "Wits" for Physical Reasoning?
arXiv 2025
VisCoder2: Building Multi-Language Visualization Coding Agents
arXiv 2025
VisCoder: Fine-Tuning LLMs for Executable Python Visualization Code Generation
arXiv 2025
MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark
NeurIPS
MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks
arXiv 2024
II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models
arXiv 2024
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
CVPR 2024 1
Affiliations
Frequent co-authors
10from 7 papers