Wenkai Yang

Papers: 14

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

14papers

Authored papers

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

arXiv 2026

2026

Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation

arXiv 2026

2026

AgentProcessBench: Diagnosing Step-Level Process Quality in Tool-Using Agents

arXiv 2026

2026

Towards Thinking-Optimal Scaling of Test-Time Compute for LLM Reasoning

arXiv 2025

2025

LaSeR: Reinforcement Learning with Last-Token Self-Rewarding

arXiv 2025

2025

DeepCritic: Deliberate Critique with Large Language Models

arXiv 2025

2025

Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents

arXiv 2024

2024

Exploring Backdoor Vulnerabilities of Chat Models

arXiv 2024

2024

Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization

arXiv 2024

2024

Towards Codable Watermarking for Injecting Multi-bits Information to LLMs

arXiv 2023

2023

Distilling Rule-based Knowledge into Large Language Models

arXiv 2023

2023

RAP: Robustness-Aware Perturbations for Defending against Backdoor Attacks on NLP Models

EMNLP 2021 11

2021

Well-classified Examples are Underestimated in Classification with Deep Neural Networks

arXiv 2021

2021

Be Careful about Poisoned Word Embeddings: Exploring the Vulnerability of the Embedding Layers in NLP Models

NAACL 2021 4

2021

Affiliations

No known affiliations.

Frequent co-authors

from 14 papers

Yankai Lin

Xu sun

Jie zhou

Ji-Rong Wen

Jingwen Chen

Lei LI

Ruobing Xie

Saiyong Yang

Weijie Liu

Xuancheng Ren