Xiaowen Chu
- Papers
- 15
Cite
Notes
Only stored in your browser.
Authored papers
15MDN: Parallelizing Stepwise Momentum for Delta Linear Attention
arXiv 2026
ChunkKV: Semantic-Preserving KV Cache Compression for Efficient Long-Context LLM Inference
arXiv 2025
FSMoE: A Flexible and Scalable Training System for Sparse Mixture-of-Experts Models
arXiv 2025
Can Compressed LLMs Truly Act? An Empirical Evaluation of Agentic Capabilities in LLM Compression
arXiv 2025
Model Quantization and Hardware Acceleration for Vision Transformers: A Comprehensive Survey
arXiv 2024
BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation
arXiv 2024
Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for Large Language Models
arXiv 2024
Should We Really Edit Language Models? On the Evaluation of Edited Language Models
arXiv 2024
BurstGPT: A Real-world Workload Dataset to Optimize LLM Serving Systems
arXiv 2024
LongGenBench: Long-context Generation Benchmark
arXiv 2024
NAS-LID: Efficient Neural Architecture Search with Local Intrinsic Dimension
arXiv 2022
Evolutionary Multi-objective Architecture Search Framework: Application to COVID-19 3D CT Classification
arXiv 2021
Automated Model Design and Benchmarking of 3D Deep Learning Models for COVID-19 Detection with Chest CT Scans
arXiv 2021
EAGAN: Efficient Two-stage Evolutionary Architecture Search for GANs
arXiv 2021
AutoML: A Survey of the State-of-the-Art
arXiv 2019
Affiliations
Frequent co-authors
10from 15 papers