Junxiao Yang

Papers: 8

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

8papers

Authored papers

Guiding not Forcing: Enhancing the Transferability of Jailbreaking Attacks on LLMs via Removing Superfluous Constraints

arXiv 2025

2025

Loong: Synthesize Long Chain-of-Thoughts at Scale through Verifiers

arXiv 2025

2025

Be Careful When Fine-tuning On Open-Source LLMs: Your Fine-tuning Data Could Be Secretly Stolen!

arXiv 2025

2025

BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs

arXiv 2025

2025

How Should We Enhance the Safety of Large Reasoning Models: An Empirical Study

arXiv 2025

2025

Agent-SafetyBench: Evaluating the Safety of LLM Agents

arXiv 2024

2024

From Theft to Bomb-Making: The Ripple Effect of Unlearning in Defending Against Jailbreak Attacks

arXiv 2024

2024

Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization

arXiv 2023

2023

Affiliations

No known affiliations.

Frequent co-authors

from 8 papers

Hongning Wang

Minlie Huang

Zhexin Zhang

Shiyao Cui

Chujie Zheng

Fei Mi

Pei Ke

Yida Lu

Beibei Wang

Bernard Ghanem