Yi Zeng
- Papers
- 16
Cite
Notes
Only stored in your browser.
Authored papers
16EDU-CIRCUIT-HW: Evaluating Multimodal Large Language Models on Real-World University-Level STEM Student Handwritten Solutions
arXiv 2026
PandaGuard: Systematic Evaluation of LLM Safety against Jailbreaking Attacks
arXiv 2025
Incorporating brain-inspired mechanisms for multimodal learning in artificial intelligence
arXiv 2025
STEP: A Unified Spiking Transformer Evaluation Platform for Fair and Reproducible Benchmarking
arXiv 2025
Enhancing Audio-Visual Spiking Neural Networks through Semantic-Alignment and Cross-Modal Residual Learning
arXiv 2025
CVC: A Large-Scale Chinese Value Rule Corpus for Value Alignment of Large Language Models
arXiv 2025
Introducing v0.5 of the AI Safety Benchmark from MLCommons
arXiv 2024
How Johnny Can Persuade LLMs to Jailbreak Them: Rethinking Persuasion to Challenge AI Safety by Humanizing LLMs
arXiv 2024
RigorLLM: Resilient Guardrails for Large Language Models against Undesired Content
arXiv 2024
BEEAR: Embedding-based Adversarial Removal of Safety Backdoors in Instruction-tuned Language Models
arXiv 2024
RedCode: Risky Code Execution and Generation Benchmark for Code Agents
arXiv 2024
SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors
arXiv 2024
CACE-Net: Co-guidance Attention and Contrastive Enhancement for Effective Audio-Visual Event Localization
arXiv 2024
Revisiting Data-Free Knowledge Distillation with Poisoned Teachers
arXiv 2023
Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To!
arXiv 2023
An Efficient Knowledge Transfer Strategy for Spiking Neural Networks from Static to Event Domain
arXiv 2023
Affiliations
Frequent co-authors
10from 16 papers