Beomseok Kang

Cite

Notes

Only stored in your browser.

Attribution

4papers

Authored papers

CompactAttention: Accelerating Chunked Prefill with Block-Union KV Selection

arXiv 2026

Token Sparse Attention: Efficient Long-Context Inference with Interleaved Token Selection

arXiv 2026

QWHA: Quantization-Aware Walsh-Hadamard Adaptation for Parameter-Efficient Fine-Tuning on Large Language Models

arXiv 2025

LiteStage: Latency-aware Layer Skipping for Multi-stage Reasoning

arXiv 2025

No known affiliations.

from 4 papers

Jae-Joon Kim

Jiwon Song

Dongwon Jo

Hyesung Jeon

Seojune Lee

Yulhwa Kim