0

Bowen Yu

Papers
34

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
34papers

Authored papers

34

HLE-Verified: A Systematic Verification and Structured Revision of Humanity's Last Exam

arXiv 2026

2026

Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward Models

arXiv 2026

2026

Qwen-Image Technical Report

arXiv 2025

2025

Qwen3-Omni Technical Report

arXiv 2025

2025

Qwen3 Technical Report

preprint

2025

Qwen3-VL Technical Report

arXiv 2025

2025

WorldPM: Scaling Human Preference Modeling

arXiv 2025

2025

Soft Adaptive Policy Optimization

arXiv 2025

2025

Qwen3Guard Technical Report

arXiv 2025

2025

RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques

arXiv 2025

2025

START: Self-taught Reasoner with Tools

arXiv 2025

2025

MARGE: Improving Math Reasoning for LLMs with Guided Exploration

arXiv 2025

2025

Qwen2.5 Technical Report

arXiv 2024

2024

Qwen2 Technical Report

arXiv 2024

2024

Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models

arXiv 2024

2024

Aligning Large Language Models via Self-Steering Optimization

arXiv 2024

2024

Scaling Data Diversity for Fine-Tuning Language Models in Human Alignment

arXiv 2024

2024

Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model

arXiv 2024

2024

SoFA: Shielded On-the-fly Alignment via Priority Rule Following

arXiv 2024

2024

Transferable Post-training via Inverse Value Learning

arXiv 2024

2024

Language Models can Self-Lengthen to Generate Long Texts

arXiv 2024

2024

Large Language Models are Superpositions of All Characters: Attaining Arbitrary Role-play via Self-Alignment

arXiv 2024

2024

Towards a Unified View of Preference Learning for Large Language Models: A Survey

arXiv 2024

2024

ProcessBench: Identifying Process Errors in Mathematical Reasoning

arXiv 2024

2024

Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment

arXiv 2024

2024

Self-Retrieval: End-to-End Information Retrieval with One Large Language Model

arXiv 2024

2024

Extend Model Merging from Fine-Tuned to Pre-Trained Large Language Models via Weight Disentanglement

arXiv 2024

2024

Rethinking Data Selection at Scale: Random Selection is Almost All You Need

arXiv 2024

2024

Qwen Technical Report

arXiv 2023

2023

Preference Ranking Optimization for Human Alignment

arXiv 2023

2023

Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch

arXiv 2023

2023

Improving Question Generation with Multi-level Content Planning

arXiv 2023

2023

GAMMA: Revisiting Template-based Automated Program Repair via Mask Prediction

arXiv 2023

2023

TPLinker: Single-stage Joint Extraction of Entities and Relations Through Token Pair Linking

COLING 2020 8

2020

Affiliations

No known affiliations.

Frequent co-authors

10

from 34 papers