Yushi Yang
- Papers
- 4
Cite
Notes
Only stored in your browser.
4papers
Authored papers
4Robust LLM Unlearning with MUDMAN: Meta-Unlearning with Disruption Masking And Normalization
arXiv 2025
Beyond Toxic Neurons: A Mechanistic Analysis of DPO for Toxicity Reduction
arXiv 2024
Can sparse autoencoders be used to decompose and interpret steering vectors?
arXiv 2024
Evaluating Fine-Tuning Efficiency of Human-Inspired Learning Strategies in Medical Question Answering
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
7from 4 papers