Wenkai Yang
- Papers
- 14
Cite
Notes
Only stored in your browser.
Authored papers
14Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe
arXiv 2026
Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation
arXiv 2026
AgentProcessBench: Diagnosing Step-Level Process Quality in Tool-Using Agents
arXiv 2026
Towards Thinking-Optimal Scaling of Test-Time Compute for LLM Reasoning
arXiv 2025
LaSeR: Reinforcement Learning with Last-Token Self-Rewarding
arXiv 2025
DeepCritic: Deliberate Critique with Large Language Models
arXiv 2025
Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents
arXiv 2024
Exploring Backdoor Vulnerabilities of Chat Models
arXiv 2024
Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization
arXiv 2024
Distilling Rule-based Knowledge into Large Language Models
arXiv 2023
Towards Codable Watermarking for Injecting Multi-bits Information to LLMs
arXiv 2023
Well-classified Examples are Underestimated in Classification with Deep Neural Networks
arXiv 2021
Be Careful about Poisoned Word Embeddings: Exploring the Vulnerability of the Embedding Layers in NLP Models
NAACL 2021 4
RAP: Robustness-Aware Perturbations for Defending against Backdoor Attacks on NLP Models
EMNLP 2021 11
Affiliations
Frequent co-authors
10from 14 papers