Cite
Notes
Only stored in your browser.
Attribution
You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories
arXiv 2026
The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning
arXiv 2025
Two Tales of Persona in LLMs: A Survey of Role-Playing and Personalization
arXiv 2024
from 3 papers
Yu Meng
Xinyu Zhu
Zhepei Wei
Chao-Wei Huang
Chengsong Huang
Danqi Chen
professor
Jiaxin Huang
Mengzhou Xia
Teng-Yun Hsiao
Yu-Chao Huang