Minki Kang
- Papers
- 11
Cite
Notes
Only stored in your browser.
Authored papers
11THINKSAFE: Self-Generated Safety Alignment for Reasoning Models
arXiv 2026
Memory Transfer Learning: How Memories are Transferred Across Domains in Coding Agents
arXiv 2026
Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR
arXiv 2026
Distilling LLM Agent into Small Models with Retrieval and Code Tools
arXiv 2025
Rethinking Reward Models for Multi-Domain Test-Time Scaling
arXiv 2025
SafeRoute: Adaptive Model Selection for Efficient and Accurate Safety Guardrails in Large Language Models
arXiv 2025
HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models
arXiv 2024
Latent Paraphrasing: Perturbation on Layers Improves Knowledge Injection in Language Models
arXiv 2024
Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-Intensive Tasks
knowledge-augmented-reasoning-distillation
Knowledge-Augmented Language Model Verification
arXiv 2023
Edge Representation Learning with Hypergraphs
NeurIPS 2021 12
Affiliations
Frequent co-authors
10from 11 papers