Jiarui Yao

Papers: 7

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

7papers

Authored papers

AgentSPEX: An Agent SPecification and EXecution Language

arXiv 2026

2026

PRL: Process Reward Learning Improves LLMs' Reasoning Ability and Broadens the Reasoning Boundary

arXiv 2026

2026

A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce

arXiv 2025

2025

GAR: Generative Adversarial Reinforcement Learning for Formal Theorem Proving

arXiv 2025

2025

Chain-of-Experts: Unlocking the Communication Power of Mixture-of-Experts Models

arXiv 2025

2025

Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL

arXiv 2025

2025

Rethinking Diverse Human Preference Learning through Principal Component Analysis

arXiv 2025

2025

Affiliations

No known affiliations.

Frequent co-authors

from 7 papers

Tong Zhang

Rui Pan

Ruida Wang

Hanze Dong

Nan Jiang

Wei Xiong

Bo Pang

Caiming Xiong

researcher

1 shared paper

Chunyuan Deng

1 shared paper

Doyen Sahoo

1 shared paper