Cite
Notes
Only stored in your browser.
Attribution
Teaching Language Models to Critique via Reinforcement Learning
arXiv 2025
FullStack Bench: Evaluating LLMs as Full Stack Coders
arXiv 2024
Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
from 3 papers
Boyi Liu
Jie Chen
Jingjing Xu
Tao Sun
Yingxiang Yang
Yongfei Liu
Yufeng Zhang
Aoyan Li
Bo Li
Bowen Li