Ruotian Ma
- Papers
- 6
Cite
Notes
Only stored in your browser.
6papers
Authored papers
6RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic Agents
arXiv 2025
S$^2$R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning
arXiv 2025
BatonVoice: An Operationalist Framework for Enhancing Controllable Speech Synthesis with Linguistic Intelligence from LLMs
arXiv 2025
Too Good to be Bad: On the Failure of LLMs to Role-Play Villains
arXiv 2025
SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning
arXiv 2025
Are Large Language Models Good Prompt Optimizers?
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 6 papers