Yida Lu
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9The Side Effects of Being Smart: Safety Risks in MLLMs' Multi-Image Reasoning
arXiv 2026
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
arXiv 2025
VPO: Aligning Text-to-Video Generation Models with Prompt Optimization
ICCV 2025
LongSafety: Evaluating Long-Context Safety of Large Language Models
arXiv 2025
ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors
arXiv 2024
Agent-SafetyBench: Evaluating the Safety of LLM Agents
arXiv 2024
SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models
arXiv 2024
AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models
arXiv 2024
From Theft to Bomb-Making: The Ripple Effect of Unlearning in Defending Against Jailbreak Attacks
arXiv 2024
Affiliations
Frequent co-authors
10from 9 papers