Jiancan Wu

Papers: 7

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

7papers

Authored papers

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation

arXiv 2026

2026

RePO: ReLU-based Preference Optimization

arXiv 2025

2025

Quantile Advantage Estimation for Entropy-Safe Reasoning

arXiv 2025

2025

Robust Preference Optimization via Dynamic Target Margins

arXiv 2025

2025

Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization

arXiv 2024

2024

$β$-DPO: Direct Preference Optimization with Dynamic $β$

arXiv 2024

2024

MuggleMath: Assessing the Impact of Query and Response Augmentation on Math Reasoning

arXiv 2023

2023

Affiliations

No known affiliations.

Frequent co-authors

from 7 papers

Xiang Wang

Junkang Wu

Xiangnan He

Bolin Ding

Jinyang Gao

Kexin Huang

Xue Wang

Yuexiang Xie

Zhengyi Yang

An Zhang