Xuan Shen
- Papers
- 11
Cite
Notes
Only stored in your browser.
Authored papers
11DraftAttention: Fast Video Diffusion via Low-Resolution Attention Guidance
arXiv 2025
Efficient Reasoning with Hidden Thinking
arXiv 2025
QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge
quartdepth-post-training-quantization-for
Enabling Flexible Multi-LLM Integration for Scalable Knowledge Aggregation
arXiv 2025
Fully Open Source Moxin-7B Technical Report
arXiv 2024
LazyDiT: Lazy Learning for the Acceleration of Diffusion Transformers
arXiv 2024
Search for Efficient Large Language Models
arXiv 2024
EdgeQAT: Entropy and Distribution Guided Quantization-Aware Training for the Acceleration of Lightweight LLMs on the Edge
arXiv 2024
Rethinking Token Reduction for State Space Models
arXiv 2024
DeepMAD: Mathematical Architecture Design for Deep Convolutional Neural Network
CVPR 2023 1
Agile-Quant: Activation-Guided Quantization for Faster Inference of LLMs on the Edge
arXiv 2023
Affiliations
Frequent co-authors
10from 11 papers