Pei Ke
- Papers
- 18
Cite
Notes
Only stored in your browser.
Authored papers
18IF-RewardBench: Benchmarking Judge Models for Instruction-Following Evaluation
arXiv 2026
From Theft to Bomb-Making: The Ripple Effect of Unlearning in Defending Against Jailbreak Attacks
arXiv 2024
CharacterBench: Benchmarking Character Customization of Large Language Models
arXiv 2024
ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors
arXiv 2024
Benchmarking Complex Instruction-Following with Multiple Constraints Composition
arXiv 2024
Towards Efficient Exact Optimization of Language Model Alignment
arXiv 2024
AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models
arXiv 2024
Click: Controllable Text Generation with Sequence Likelihood Contrastive Learning
arXiv 2023
Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization
arXiv 2023
Unveiling the Implicit Toxicity in Large Language Models
arXiv 2023
AlignBench: Benchmarking Chinese Alignment of Large Language Models
arXiv 2023
Black-Box Prompt Optimization: Aligning Large Language Models without Model Training
arXiv 2023
CritiqueLLM: Towards an Informative Critique Generation Model for Evaluation of Large Language Model Generation
arXiv 2023
Directed Acyclic Transformer Pre-training for High-quality Non-autoregressive Text Generation
arXiv 2023
EVA2.0: Investigating Open-Domain Chinese Dialogue Systems with Large-Scale Pre-Training
arXiv 2022
Rethinking and Refining the Distinct Metric
ACL 2022 5
A Large-Scale Chinese Short-Text Conversation Dataset
arXiv 2020
CPM: A Large-scale Generative Chinese Pre-trained Language Model
arXiv 2020
Affiliations
Frequent co-authors
10from 18 papers