0

Guoyin Wang

Papers
21

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
21papers

Authored papers

21

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

arXiv 2026

2026

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

arXiv 2026

2026

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation

arXiv 2026

2026

FaceID-6M: A Large-Scale, Open-Source FaceID Customization Dataset

arXiv 2025

2025

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

arXiv 2025

2025

Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding

arXiv 2025

2025

Turn That Frown Upside Down: FaceID Customization via Cross-Training Data

arXiv 2025

2025

OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use

arXiv 2025

2025

On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting

arXiv 2025

2025

FullStack Bench: Evaluating LLMs as Full Stack Coders

arXiv 2024

2024

InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks

arXiv 2024

2024

PA-LLaVA: A Large Language-Vision Assistant for Human Pathology Image Understanding

arXiv 2024

2024

The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism

arXiv 2024

2024

Empowering Large Language Model Agents through Action Learning

arXiv 2024

2024

Yi: Open Foundation Models by 01.AI

arXiv 2024

2024

Aria: An Open Multimodal Native Mixture-of-Experts Model

arXiv 2024

2024

Reinforcement Learning Enhanced LLMs: A Survey

arXiv 2024

2024

Are Human-generated Demonstrations Necessary for In-context Learning?

arXiv 2023

2023

Towards Building the Federated GPT: Federated Instruction Tuning

arXiv 2023

2023

Instruction Tuning for Large Language Models: A Survey

arXiv 2023

2023

POINTER: Constrained Progressive Text Generation via Insertion-based Generative Pre-training

EMNLP 2020 11

2020

Affiliations

No known affiliations.

Frequent co-authors

10

from 21 papers