Yaliang Li
- Papers
- 17
Cite
Notes
Only stored in your browser.
Authored papers
17On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models
arXiv 2026
TCOD: Exploring Temporal Curriculum in On-Policy Distillation for Multi-turn Autonomous Agents
arXiv 2026
R^3L: Reflect-then-Retry Reinforcement Learning with Language-Guided Exploration, Pivotal Credit, and Positive Amplification
arXiv 2026
AgentScope 1.0: A Developer-Centric Framework for Building Agentic Applications
arXiv 2025
DetailMaster: Can Your Text-to-Image Model Handle Long Prompts?
arXiv 2025
Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models
arXiv 2025
On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting
arXiv 2025
Group-Relative REINFORCE Is Secretly an Off-Policy Algorithm: Demystifying Some Myths About GRPO and Its Friends
arXiv 2025
Very Large-Scale Multi-Agent Simulation in AgentScope
arXiv 2024
ArtAug: Enhancing Text-to-Image Generation through Synthesis-Understanding Interaction
arXiv 2024
EE-Tuning: An Economical yet Scalable Solution for Tuning Early-Exit Large Language Models
arXiv 2024
ChartThinker: A Contextual Chain-of-Thought Approach to Optimized Chart Summarization
arXiv 2024
Exploring Selective Layer Fine-Tuning in Federated Learning
arXiv 2024
Data-Juicer: A One-Stop Data Processing System for Large Language Models
arXiv 2023
Text-to-SQL Empowered by Large Language Models: A Benchmark Evaluation
arXiv 2023
Federated Full-Parameter Tuning of Billion-Sized Language Models with Communication Cost under 18 Kilobytes
arXiv 2023
TEST: Text Prototype Aligned Embedding to Activate LLM's Ability for Time Series
arXiv 2023
Affiliations
Frequent co-authors
10from 17 papers