Chak Tou Leong
- Papers
- 8
Cite
Notes
Only stored in your browser.
8papers
Authored papers
8TokenSkip: Controllable Chain-of-Thought Compression in LLMs
arXiv 2025
SPA-RL: Reinforcing LLM Agents via Stepwise Progress Attribution
arXiv 2025
Evaluating Parameter Efficient Methods for RLVR
arXiv 2025
Refusal Falls off a Cliff: How Safety Alignment Fails in Reasoning?
arXiv 2025
STeCa: Step-level Trajectory Calibration for LLM Agent Learning
arXiv 2025
Direct Preference Optimization Using Sparse Feature-Level Constraints
arXiv 2024
Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue
arXiv 2024
Self-Detoxifying Language Models via Toxification Reversal
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 8 papers