Benfeng Xu
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13DeepResearch Bench II: Diagnosing Deep Research Agents via Rubrics from Expert Report
arXiv 2026
A-RAG: Scaling Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces
arXiv 2026
FS-Researcher: Test-Time Scaling for Long-Horizon Research Tasks with File-System-Based Agents
arXiv 2026
WildGraphBench: Benchmarking GraphRAG with Wild-Source Corpora
arXiv 2026
Wiki Live Challenge: Challenging Deep Research Agents with Expert-Level Wikipedia Articles
arXiv 2026
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents
arXiv 2025
From Real to Synthetic: Synthesizing Millions of Diversified and Complicated User Instructions with Attributed Grounding
arXiv 2025
Rationales Are Not Silver Bullets: Measuring the Impact of Rationales on Model Performance and Reliability
arXiv 2025
Benchmarking Large Language Models on Controllable Generation under Diversified Instructions
arXiv 2024
Qwen Technical Report
arXiv 2023
ExpertPrompting: Instructing Large Language Models to be Distinguished Experts
arXiv 2023
Self-Evolved Diverse Data Sampling for Efficient Instruction Tuning
arXiv 2023
Building Chinese Biomedical Language Models via Multi-Level Text Discrimination
arXiv 2021
Affiliations
Frequent co-authors
10from 13 papers