0

Yaliang Li

Papers
17

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
17papers

Authored papers

17

On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models

arXiv 2026

2026

TCOD: Exploring Temporal Curriculum in On-Policy Distillation for Multi-turn Autonomous Agents

arXiv 2026

2026

R^3L: Reflect-then-Retry Reinforcement Learning with Language-Guided Exploration, Pivotal Credit, and Positive Amplification

arXiv 2026

2026

AgentScope 1.0: A Developer-Centric Framework for Building Agentic Applications

arXiv 2025

2025

DetailMaster: Can Your Text-to-Image Model Handle Long Prompts?

arXiv 2025

2025

Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models

arXiv 2025

2025

On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting

arXiv 2025

2025

Group-Relative REINFORCE Is Secretly an Off-Policy Algorithm: Demystifying Some Myths About GRPO and Its Friends

arXiv 2025

2025

Very Large-Scale Multi-Agent Simulation in AgentScope

arXiv 2024

2024

ArtAug: Enhancing Text-to-Image Generation through Synthesis-Understanding Interaction

arXiv 2024

2024

EE-Tuning: An Economical yet Scalable Solution for Tuning Early-Exit Large Language Models

arXiv 2024

2024

ChartThinker: A Contextual Chain-of-Thought Approach to Optimized Chart Summarization

arXiv 2024

2024

Exploring Selective Layer Fine-Tuning in Federated Learning

arXiv 2024

2024

Data-Juicer: A One-Stop Data Processing System for Large Language Models

arXiv 2023

2023

Text-to-SQL Empowered by Large Language Models: A Benchmark Evaluation

arXiv 2023

2023

Federated Full-Parameter Tuning of Billion-Sized Language Models with Communication Cost under 18 Kilobytes

arXiv 2023

2023

TEST: Text Prototype Aligned Embedding to Activate LLM's Ability for Time Series

arXiv 2023

2023

Affiliations

No known affiliations.

Frequent co-authors

10

from 17 papers