Lei Sha
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9Reasoning-to-Defend: Safety-Aware Reasoning Can Defend Large Language Models from Jailbreaking
arXiv 2025
How Far are LLMs from Being Our Digital Twins? A Benchmark for Persona-Based Behavior Chain Simulation
arXiv 2025
Benchmarking Multi-Step Legal Reasoning and Analyzing Chain-of-Thought Effects in Large Language Models
arXiv 2025
Derail Yourself: Multi-turn LLM Jailbreak Attack through Self-discovered Clues
arXiv 2024
ATM: Adversarial Tuning Multi-agent System Makes a Robust Retrieval-Augmented Generator
arXiv 2024
Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models
arXiv 2024
ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors
arXiv 2024
Towards a Unified View of Preference Learning for Large Language Models: A Survey
arXiv 2024
Bird-Eye Transformers for Text Generation Models
arXiv 2022
Affiliations
Frequent co-authors
10from 9 papers