Zhaopeng Tu
- Papers
- 34
Cite
Notes
Only stored in your browser.
Authored papers
34SkillNet: Create, Evaluate, and Connect AI Skills
arXiv 2026
DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning
arXiv 2025
Beyond Prompt Engineering: Robust Behavior Control in LLMs via Steering Target Atoms
arXiv 2025
BatonVoice: An Operationalist Framework for Enhancing Controllable Speech Synthesis with Linguistic Intelligence from LLMs
arXiv 2025
Too Good to be Bad: On the Failure of LLMs to Role-Play Villains
arXiv 2025
DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning
arXiv 2025
SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning
arXiv 2025
Don't Get Lost in the Trees: Streamlining LLM Reasoning by Overcoming Tree Search Exploration Pitfalls
arXiv 2025
RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic Agents
arXiv 2025
Deep Research: A Systematic Survey
arXiv 2025
Trust, But Verify: A Self-Verification Approach to Reinforcement Learning with Verifiable Rewards
arXiv 2025
Critical Tokens Matter: Token-Level Contrastive Estimation Enhances LLM's Reasoning Capability
arXiv 2024
Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models
arXiv 2024
Benchmarking LLMs via Uncertainty Quantification
arXiv 2024
How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments
arXiv 2024
CoAct: A Global-Local Hierarchy for Autonomous Agent Collaboration
arXiv 2024
Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training
arXiv 2024
NewTerm: Benchmarking Real-Time New Terms for Large Language Models with Annual Updates
arXiv 2024
Draft Model Knows When to Stop: A Self-Verification Length Policy for Speculative Decoding
arXiv 2024
Salute the Classic: Revisiting Challenges of Machine Translation in the Age of Large Language Models
arXiv 2024
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher
arXiv 2023
Macaw-LLM: Multi-Modal Language Modeling with Image, Audio, Video, and Text Integration
arXiv 2023
Who is ChatGPT? Benchmarking LLMs' Psychological Portrayal Using PsychoBench
arXiv 2023
Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate
arXiv 2023
Is ChatGPT A Good Translator? Yes With GPT-4 As The Engine
arXiv 2023
Exploring Human-Like Translation Strategy with Large Language Models
arXiv 2023
All Languages Matter: On the Multilingual Safety of Large Language Models
arXiv 2023
Leveraging Word Guessing Games to Assess the Intelligence of Large Language Models
arXiv 2023
Bridging the Data Gap between Training and Inference for Unsupervised Neural Machine Translation
ACL 2022 5
Adapters for Enhanced Modeling of Multilingual Knowledge and Text
arXiv 2022
On the Copying Behaviors of Pre-Training for Neural Machine Translation
Findings (ACL) 2021 8
Rejuvenating Low-Frequency Words: Making the Most of Parallel Data in Non-Autoregressive Translation
ACL 2021 5
On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation
Findings (EMNLP) 2021 11
Understanding and Improving Encoder Layer Fusion in Sequence-to-Sequence Learning
understanding-and-improving-encoder-layer
Affiliations
Frequent co-authors
10from 34 papers