Wenbo Su
- Papers
- 16
Cite
Notes
Only stored in your browser.
Authored papers
16Complementary Reinforcement Learning
arXiv 2026
One Sample to Rule Them All: Extreme Data Efficiency in RL Scaling
arXiv 2026
Reinforcement Learning Optimization for Large-Scale Learning: An Efficient and User-Friendly Scaling Library
arXiv 2025
A Comprehensive Survey on Long Context Language Modeling
arXiv 2025
Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem
arXiv 2025
ChineseEcomQA: A Scalable E-commerce Concept Evaluation Benchmark for Large Language Models
arXiv 2025
UQABench: Evaluating User Embedding for Prompting LLMs in Personalized Question Answering
arXiv 2025
ECKGBench: Benchmarking Large Language Models in E-commerce Leveraging Knowledge Graph
arXiv 2025
Deconstructing Long Chain-of-Thought: A Structured Reasoning Optimization Framework for Long CoT Distillation
arXiv 2025
Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?
arXiv 2025
"See the World, Discover Knowledge": A Chinese Factuality Evaluation for Large Vision Language Models
arXiv 2025
Think-J: Learning to Think for Generative LLM-as-a-Judge
arXiv 2025
USB: A Comprehensive and Unified Safety Evaluation Benchmark for Multimodal Large Language Models
arXiv 2025
ProgCo: Program Helps Self-Correction of Large Language Models
arXiv 2025
AIR: Complex Instruction Generation via Automatic Iterative Refinement
arXiv 2025
MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models
arXiv 2024
Affiliations
Frequent co-authors
10from 16 papers