0

Wenbo Su

Papers
16

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
16papers

Authored papers

16

Complementary Reinforcement Learning

arXiv 2026

2026

One Sample to Rule Them All: Extreme Data Efficiency in RL Scaling

arXiv 2026

2026

Reinforcement Learning Optimization for Large-Scale Learning: An Efficient and User-Friendly Scaling Library

arXiv 2025

2025

A Comprehensive Survey on Long Context Language Modeling

arXiv 2025

2025

Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

arXiv 2025

2025

ChineseEcomQA: A Scalable E-commerce Concept Evaluation Benchmark for Large Language Models

arXiv 2025

2025

UQABench: Evaluating User Embedding for Prompting LLMs in Personalized Question Answering

arXiv 2025

2025

ECKGBench: Benchmarking Large Language Models in E-commerce Leveraging Knowledge Graph

arXiv 2025

2025

Deconstructing Long Chain-of-Thought: A Structured Reasoning Optimization Framework for Long CoT Distillation

arXiv 2025

2025

Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?

arXiv 2025

2025

"See the World, Discover Knowledge": A Chinese Factuality Evaluation for Large Vision Language Models

arXiv 2025

2025

Think-J: Learning to Think for Generative LLM-as-a-Judge

arXiv 2025

2025

USB: A Comprehensive and Unified Safety Evaluation Benchmark for Multimodal Large Language Models

arXiv 2025

2025

ProgCo: Program Helps Self-Correction of Large Language Models

arXiv 2025

2025

AIR: Complex Instruction Generation via Automatic Iterative Refinement

arXiv 2025

2025

MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models

arXiv 2024

2024

Affiliations

No known affiliations.

Frequent co-authors

10

from 16 papers