0

Bolin Ding

Papers
19

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
19papers

Authored papers

19

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

arXiv 2026

2026

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation

arXiv 2026

2026

AgentScope 1.0: A Developer-Centric Framework for Building Agentic Applications

arXiv 2025

2025

Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models

arXiv 2025

2025

Remember Me, Refine Me: A Dynamic Procedural Memory Framework for Experience-Driven Agent Evolution

arXiv 2025

2025

On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting

arXiv 2025

2025

Group-Relative REINFORCE Is Secretly an Off-Policy Algorithm: Demystifying Some Myths About GRPO and Its Friends

arXiv 2025

2025

RePO: ReLU-based Preference Optimization

arXiv 2025

2025

Incentivizing Reasoning from Weak Supervision

arXiv 2025

2025

CuES: A Curiosity-driven and Environment-grounded Synthesis Framework for Agentic RL

arXiv 2025

2025

Very Large-Scale Multi-Agent Simulation in AgentScope

arXiv 2024

2024

EE-Tuning: An Economical yet Scalable Solution for Tuning Early-Exit Large Language Models

arXiv 2024

2024

$β$-DPO: Direct Preference Optimization with Dynamic $β$

arXiv 2024

2024

Exploring Selective Layer Fine-Tuning in Federated Learning

arXiv 2024

2024

Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization

arXiv 2024

2024

Data-Juicer: A One-Stop Data Processing System for Large Language Models

arXiv 2023

2023

Text-to-SQL Empowered by Large Language Models: A Benchmark Evaluation

arXiv 2023

2023

Federated Full-Parameter Tuning of Billion-Sized Language Models with Communication Cost under 18 Kilobytes

arXiv 2023

2023

CARD: Channel Aligned Robust Blend Transformer for Time Series Forecasting

arXiv 2023

2023

Affiliations

No known affiliations.

Frequent co-authors

10

from 19 papers