Yubo Ma

Cite

Notes

Only stored in your browser.

Attribution

5papers

Authored papers

WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation

arXiv 2026

InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model

arXiv 2025

Long Context vs. RAG for LLMs: An Evaluation and Revisits

arXiv 2024

AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge

arXiv 2024

Improving Large Language Models in Event Relation Logical Prediction

arXiv 2023

No known affiliations.

from 5 papers

Dahua Lin

Haodong Duan

Jiaqi Wang

Kai Chen

Shengyuan Ding

Yixin Cao

Yuhang Zang

Ziyu Liu

Aixin Sun

Anh Tuan Luu