0

Pei Ke

Papers
18

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
18papers

Authored papers

18

IF-RewardBench: Benchmarking Judge Models for Instruction-Following Evaluation

arXiv 2026

2026

From Theft to Bomb-Making: The Ripple Effect of Unlearning in Defending Against Jailbreak Attacks

arXiv 2024

2024

CharacterBench: Benchmarking Character Customization of Large Language Models

arXiv 2024

2024

ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors

arXiv 2024

2024

Benchmarking Complex Instruction-Following with Multiple Constraints Composition

arXiv 2024

2024

Towards Efficient Exact Optimization of Language Model Alignment

arXiv 2024

2024

AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models

arXiv 2024

2024

Click: Controllable Text Generation with Sequence Likelihood Contrastive Learning

arXiv 2023

2023

Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization

arXiv 2023

2023

Unveiling the Implicit Toxicity in Large Language Models

arXiv 2023

2023

AlignBench: Benchmarking Chinese Alignment of Large Language Models

arXiv 2023

2023

Black-Box Prompt Optimization: Aligning Large Language Models without Model Training

arXiv 2023

2023

CritiqueLLM: Towards an Informative Critique Generation Model for Evaluation of Large Language Model Generation

arXiv 2023

2023

Directed Acyclic Transformer Pre-training for High-quality Non-autoregressive Text Generation

arXiv 2023

2023

EVA2.0: Investigating Open-Domain Chinese Dialogue Systems with Large-Scale Pre-Training

arXiv 2022

2022

Rethinking and Refining the Distinct Metric

ACL 2022 5

2022

A Large-Scale Chinese Short-Text Conversation Dataset

arXiv 2020

2020

CPM: A Large-scale Generative Chinese Pre-trained Language Model

arXiv 2020

2020

Affiliations

No known affiliations.

Frequent co-authors

10

from 18 papers