Bolin Ding
- Papers
- 19
Cite
Notes
Only stored in your browser.
Authored papers
19FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization
arXiv 2026
On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation
arXiv 2026
AgentScope 1.0: A Developer-Centric Framework for Building Agentic Applications
arXiv 2025
Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models
arXiv 2025
Remember Me, Refine Me: A Dynamic Procedural Memory Framework for Experience-Driven Agent Evolution
arXiv 2025
On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting
arXiv 2025
Group-Relative REINFORCE Is Secretly an Off-Policy Algorithm: Demystifying Some Myths About GRPO and Its Friends
arXiv 2025
RePO: ReLU-based Preference Optimization
arXiv 2025
Incentivizing Reasoning from Weak Supervision
arXiv 2025
CuES: A Curiosity-driven and Environment-grounded Synthesis Framework for Agentic RL
arXiv 2025
Very Large-Scale Multi-Agent Simulation in AgentScope
arXiv 2024
EE-Tuning: An Economical yet Scalable Solution for Tuning Early-Exit Large Language Models
arXiv 2024
$β$-DPO: Direct Preference Optimization with Dynamic $β$
arXiv 2024
Exploring Selective Layer Fine-Tuning in Federated Learning
arXiv 2024
Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization
arXiv 2024
Data-Juicer: A One-Stop Data Processing System for Large Language Models
arXiv 2023
Text-to-SQL Empowered by Large Language Models: A Benchmark Evaluation
arXiv 2023
Federated Full-Parameter Tuning of Billion-Sized Language Models with Communication Cost under 18 Kilobytes
arXiv 2023
CARD: Channel Aligned Robust Blend Transformer for Time Series Forecasting
arXiv 2023
Affiliations
Frequent co-authors
10from 19 papers