Guoyin Wang
- Papers
- 21
Cite
Notes
Only stored in your browser.
Authored papers
21GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning
arXiv 2026
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization
arXiv 2026
On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation
arXiv 2026
FaceID-6M: A Large-Scale, Open-Source FaceID Customization Dataset
arXiv 2025
COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values
arXiv 2025
Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding
arXiv 2025
Turn That Frown Upside Down: FaceID Customization via Cross-Training Data
arXiv 2025
OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use
arXiv 2025
On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting
arXiv 2025
FullStack Bench: Evaluating LLMs as Full Stack Coders
arXiv 2024
InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks
arXiv 2024
PA-LLaVA: A Large Language-Vision Assistant for Human Pathology Image Understanding
arXiv 2024
The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism
arXiv 2024
Empowering Large Language Model Agents through Action Learning
arXiv 2024
Yi: Open Foundation Models by 01.AI
arXiv 2024
Aria: An Open Multimodal Native Mixture-of-Experts Model
arXiv 2024
Reinforcement Learning Enhanced LLMs: A Survey
arXiv 2024
Are Human-generated Demonstrations Necessary for In-context Learning?
arXiv 2023
Towards Building the Federated GPT: Federated Instruction Tuning
arXiv 2023
Instruction Tuning for Large Language Models: A Survey
arXiv 2023
POINTER: Constrained Progressive Text Generation via Insertion-based Generative Pre-training
EMNLP 2020 11
Affiliations
Frequent co-authors
10from 21 papers