Minlie Huang
- Papers
- 87
Cite
Notes
Only stored in your browser.
Authored papers
87GLM-5: from Vibe Coding to Agentic Engineering
arXiv 2026
GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents
arXiv 2026
The Side Effects of Being Smart: Safety Risks in MLLMs' Multi-Image Reasoning
arXiv 2026
IF-RewardBench: Benchmarking Judge Models for Instruction-Following Evaluation
arXiv 2026
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
arXiv 2025
Guiding not Forcing: Enhancing the Transferability of Jailbreaking Attacks on LLMs via Removing Superfluous Constraints
arXiv 2025
SocialEval: Evaluating Social Intelligence of Large Language Models
arXiv 2025
Crisp: Cognitive Restructuring of Negative Thoughts through Multi-turn Supportive Dialogues
arXiv 2025
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
arXiv 2025
Human Decision-making is Susceptible to AI-driven Manipulation
arXiv 2025
Be Careful When Fine-tuning On Open-Source LLMs: Your Fine-tuning Data Could Be Secretly Stolen!
arXiv 2025
VPO: Aligning Text-to-Video Generation Models with Prompt Optimization
ICCV 2025
Data-Efficient RLVR via Off-Policy Influence Guidance
arXiv 2025
Trust-Region Adaptive Policy Optimization
arXiv 2025
BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs
arXiv 2025
LongSafety: Evaluating Long-Context Safety of Large Language Models
arXiv 2025
How Should We Enhance the Safety of Large Reasoning Models: An Empirical Study
arXiv 2025
EmoBench: Evaluating the Emotional Intelligence of Large Language Models
arXiv 2024
On Prompt-Driven Safeguarding for Large Language Models
arXiv 2024
Benchmarking Complex Instruction-Following with Multiple Constraints Composition
arXiv 2024
Agent-SafetyBench: Evaluating the Safety of LLM Agents
arXiv 2024
Weak-to-Strong Extrapolation Expedites Alignment
arXiv 2024
MiniPLM: Knowledge Distillation for Pre-Training Language Models
arXiv 2024
Towards Efficient Exact Optimization of Language Model Alignment
arXiv 2024
ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools
arXiv 2024
VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation
arXiv 2024
ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors
arXiv 2024
SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models
arXiv 2024
AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models
arXiv 2024
AMOR: A Recipe for Building Adaptable Modular Knowledge Agents Through Process Feedback
arXiv 2024
From Theft to Bomb-Making: The Ripple Effect of Unlearning in Defending Against Jailbreak Attacks
arXiv 2024
LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models
arXiv 2024
Language Models Learn to Mislead Humans via RLHF
arXiv 2024
CharacterBench: Benchmarking Character Customization of Large Language Models
arXiv 2024
Unlocking Reasoning Potential in Large Langauge Models by Scaling Code-form Planning
arXiv 2024
Seeker: Enhancing Exception Handling in Code with LLM-based Multi-Agent Approach
arXiv 2024
Seeker: Towards Exception Safety Code Generation with Intermediate Language Agents Framework
arXiv 2024
SS-GEN: A Social Story Generation Framework with Large Language Models
arXiv 2024
ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving
arXiv 2023
Click: Controllable Text Generation with Sequence Likelihood Contrastive Learning
arXiv 2023
Pre-Training to Learn in Context
arXiv 2023
Task-Adaptive Tokenization: Enhancing Long-Form Text Generation Efficacy in Mental Health and Beyond
arXiv 2023
CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models
arXiv 2023
AgentBench: Evaluating LLMs as Agents
arXiv 2023
Safety Assessment of Chinese Large Language Models
arXiv 2023
AlignBench: Benchmarking Chinese Alignment of Large Language Models
arXiv 2023
Black-Box Prompt Optimization: Aligning Large Language Models without Model Training
arXiv 2023
SafetyBench: Evaluating the Safety of Large Language Models
arXiv 2023
CritiqueLLM: Towards an Informative Critique Generation Model for Evaluation of Large Language Model Generation
arXiv 2023
Directed Acyclic Transformer Pre-training for High-quality Non-autoregressive Text Generation
arXiv 2023
Large Language Models Are Not Robust Multiple Choice Selectors
arXiv 2023
Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization
arXiv 2023
Ethicist: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confidence Estimation
arXiv 2023
Unveiling the Implicit Toxicity in Large Language Models
arXiv 2023
Re$^3$Dial: Retrieve, Reorganize and Rescale Dialogue Corpus for Long-Turn Open-Domain Dialogue Pre-training
arXiv 2023
PAL: Persona-Augmented Emotional Support Conversation Generation
arXiv 2022
Rethinking and Refining the Distinct Metric
ACL 2022 5
COLD: A Benchmark for Chinese Offensive Language Detection
arXiv 2022
EVA2.0: Investigating Open-Domain Chinese Dialogue Systems with Large-Scale Pre-Training
arXiv 2022
CPED: A Large-Scale Chinese Personalized and Emotional Dialogue Dataset for Conversational AI
arXiv 2022
ConvLab-3: A Flexible Dialogue System Toolkit Based on a Unified Data Format
arXiv 2022
AugESC: Dialogue Augmentation with Large Language Models for Emotional Support Conversation
arXiv 2022
CASE: Aligning Coarse-to-Fine Cognition and Affection for Empathetic Response Generation
arXiv 2022
Persona-Guided Planning for Controlling the Protagonist's Persona in Story Generation
arXiv 2022
CDConv: A Benchmark for Contradiction Detection in Chinese Conversations
arXiv 2022
AutoCAD: Automatically Generating Counterfactuals for Mitigating Shortcut Learning
arXiv 2022
LaMemo: Language Modeling with Look-Ahead Memory
NAACL 2022 7
MoralDial: A Framework to Train and Evaluate Moral Dialogue Systems via Moral Discussions
arXiv 2022
LOT: A Story-Centric Benchmark for Evaluating Chinese Long Text Understanding and Generation
arXiv 2021
CEM: Commonsense-aware Empathetic Response Generation
arXiv 2021
Towards Emotional Support Dialog Systems
ACL 2021 5
PsyQA: A Chinese Dataset for Generating Long Counseling Text for Mental Health Support
Findings (ACL) 2021 8
OpenMEVA: A Benchmark for Evaluating Open-ended Story Generation Metrics
ACL 2021 5
CoMAE: A Multi-factor Hierarchical Framework for Empathetic Response Generation
Findings (ACL) 2021 8
DiscoDVT: Generating Long Text with Discourse-Aware Discrete Variational Transformer
EMNLP 2021 11
On the Safety of Conversational Models: Taxonomy, Dataset, and Benchmark
Findings (ACL) 2022 5
HyKnow: End-to-End Task-Oriented Dialog Modeling with Hybrid Knowledge Management
Findings (ACL) 2021 8
Transferable Persona-Grounded Dialogues via Grounded Minimal Edits
EMNLP 2021 11
KdConv: A Chinese Multi-domain Dialogue Dataset Towards Multi-turn Knowledge-driven Conversation
kdconv-a-chinese-multi-domain-dialogue-1
A Large-Scale Chinese Short-Text Conversation Dataset
arXiv 2020
CPM: A Large-scale Generative Chinese Pre-trained Language Model
arXiv 2020
CrossWOZ: A Large-Scale Chinese Cross-Domain Task-Oriented Dialogue Dataset
crosswoz-a-large-scale-chinese-cross-domain-1
Robustness Testing of Language Understanding in Task-Oriented Dialog
ACL 2021 5
CR-Walker: Tree-Structured Graph Reasoning and Dialog Acts for Conversational Recommendation
EMNLP 2021 11
A Self-Training Method for Machine Reading Comprehension with Soft Evidence Extraction
a-self-training-method-for-machine-reading-1
Difference-aware Knowledge Selection for Knowledge-grounded Conversation Generation
Findings of the Association for Computational Linguistics 2020
Emotional Chatting Machine: Emotional Conversation Generation with Internal and External Memory
arXiv 2017
Affiliations
Frequent co-authors
10from 87 papers