Cite
Notes
Only stored in your browser.
Attribution
Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition
arXiv 2024
Towards Bidirectional Human-AI Alignment: A Systematic Review for Clarifications, Framework, and Future Directions
On Surgical Fine-tuning for Language Encoders
arXiv 2023
from 3 papers
Abhilasha Lodha
Ahmed Salem
Chenglei Si
Chenhao Li
Daniel Paleka
David Jurgens
Diyi Yang
Dmitrii Petrov
Dragos Albastroiu
Edoardo Debenedetti