0

Zhaopeng Tu

Papers
34

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
34papers

Authored papers

34

SkillNet: Create, Evaluate, and Connect AI Skills

arXiv 2026

2026

DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

arXiv 2025

2025

Beyond Prompt Engineering: Robust Behavior Control in LLMs via Steering Target Atoms

arXiv 2025

2025

BatonVoice: An Operationalist Framework for Enhancing Controllable Speech Synthesis with Linguistic Intelligence from LLMs

arXiv 2025

2025

Too Good to be Bad: On the Failure of LLMs to Role-Play Villains

arXiv 2025

2025

DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning

arXiv 2025

2025

SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning

arXiv 2025

2025

Don't Get Lost in the Trees: Streamlining LLM Reasoning by Overcoming Tree Search Exploration Pitfalls

arXiv 2025

2025

RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic Agents

arXiv 2025

2025

Deep Research: A Systematic Survey

arXiv 2025

2025

Trust, But Verify: A Self-Verification Approach to Reinforcement Learning with Verifiable Rewards

arXiv 2025

2025

Critical Tokens Matter: Token-Level Contrastive Estimation Enhances LLM's Reasoning Capability

arXiv 2024

2024

Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models

arXiv 2024

2024

Benchmarking LLMs via Uncertainty Quantification

arXiv 2024

2024

How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments

arXiv 2024

2024

CoAct: A Global-Local Hierarchy for Autonomous Agent Collaboration

arXiv 2024

2024

Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training

arXiv 2024

2024

NewTerm: Benchmarking Real-Time New Terms for Large Language Models with Annual Updates

arXiv 2024

2024

Draft Model Knows When to Stop: A Self-Verification Length Policy for Speculative Decoding

arXiv 2024

2024

Salute the Classic: Revisiting Challenges of Machine Translation in the Age of Large Language Models

arXiv 2024

2024

GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher

arXiv 2023

2023

Macaw-LLM: Multi-Modal Language Modeling with Image, Audio, Video, and Text Integration

arXiv 2023

2023

Who is ChatGPT? Benchmarking LLMs' Psychological Portrayal Using PsychoBench

arXiv 2023

2023

Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate

arXiv 2023

2023

Is ChatGPT A Good Translator? Yes With GPT-4 As The Engine

arXiv 2023

2023

Exploring Human-Like Translation Strategy with Large Language Models

arXiv 2023

2023

All Languages Matter: On the Multilingual Safety of Large Language Models

arXiv 2023

2023

Leveraging Word Guessing Games to Assess the Intelligence of Large Language Models

arXiv 2023

2023

Bridging the Data Gap between Training and Inference for Unsupervised Neural Machine Translation

ACL 2022 5

2022

Adapters for Enhanced Modeling of Multilingual Knowledge and Text

arXiv 2022

2022

On the Copying Behaviors of Pre-Training for Neural Machine Translation

Findings (ACL) 2021 8

2021

Rejuvenating Low-Frequency Words: Making the Most of Parallel Data in Non-Autoregressive Translation

ACL 2021 5

2021

On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation

Findings (EMNLP) 2021 11

2021

Understanding and Improving Encoder Layer Fusion in Sequence-to-Sequence Learning

understanding-and-improving-encoder-layer

2020

Affiliations

No known affiliations.

Frequent co-authors

10

from 34 papers