Haoran You
- Papers
- 6
Cite
Notes
Only stored in your browser.
Authored papers
6ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization
arXiv 2024
When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models
arXiv 2024
EDGE-LLM: Enabling Efficient Large Language Model Adaptation on Edge Devices via Layerwise Unified Compression and Adaptive Layer Tuning and Voting
arXiv 2024
ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer
shiftaddvit-mixture-of-multiplication
SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning
arXiv 2022
Max-Affine Spline Insights Into Deep Network Pruning
max-affine-spline-insights-into-deep-network
Affiliations
Frequent co-authors
10from 6 papers