Tiansheng Huang
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search
arXiv 2025
Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation
arXiv 2025
Safety Tax: Safety Alignment Makes Your Large Reasoning Models Less Reasonable
arXiv 2025
Multi-Agent Reinforcement Learning with Focal Diversity Optimization
arXiv 2025
Harmful Fine-tuning Attacks and Defenses for Large Language Models: A Survey
arXiv 2024
A Survey on Large Language Model-Based Game Agents
arXiv 2024
Vaccine: Perturbation-aware Alignment for Large Language Models against Harmful Fine-tuning Attack
arXiv 2024
Booster: Tackling Harmful Fine-tuning for Large Language Models via Attenuating Harmful Perturbation
arXiv 2024
Antidote: Post-fine-tuning Safety Alignment for Large Language Models against Harmful Fine-tuning
arXiv 2024
Lisa: Lazy Safety Alignment for Large Language Models against Harmful Fine-tuning Attack
arXiv 2024
PokeLLMon: A Human-Parity Agent for Pokemon Battles with Large Language Models
arXiv 2024
Large Language Model-Powered Smart Contract Vulnerability Detection: New Perspectives
arXiv 2023
FedSpeed: Larger Local Interval, Less Communication Round, and Higher Generalization Accuracy
arXiv 2023
Affiliations
Frequent co-authors
10from 13 papers