Huizhuo Yuan

Papers: 6

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

6papers

Authored papers

Tensor Product Attention Is All You Need

arXiv 2025

2025

On the Design of KL-Regularized Policy Gradient Algorithms for LLM Reasoning

arXiv 2025

2025

Group Representational Position Encoding

arXiv 2025

2025

MARS: Unleashing the Power of Variance Reduction for Training Large Models

arXiv 2024

2024

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

arXiv 2024

2024

Self-Play Preference Optimization for Language Model Alignment

arXiv 2024

2024

Affiliations

No known affiliations.

Frequent co-authors

from 6 papers

Quanquan Gu

Yifeng Liu

Yang Yuan

Yifan Zhang

Andrew Chi-Chih Yao

Kaixuan Ji

Zhen Qin

Zixiang Chen

Andrew C Yao

Kangping Xu