Junxian Guo
- Papers
- 7
Cite
Notes
Only stored in your browser.
7papers
Authored papers
7XAttention: Block Sparse Attention with Antidiagonal Scoring
arXiv 2025
LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention
arXiv 2025
Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter
arXiv 2025
Optimizing Mixture of Block Attention
arXiv 2025
SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity
arXiv 2025
SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
arXiv 2024
DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 7 papers