Ahmed F AbouElhamayed
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5SplitReason: Learning To Offload Reasoning
arXiv 2025
SparAMX: Accelerating Compressed LLMs Token Generation on AMX-powered CPUs
arXiv 2025
TokenButler: Token Importance is Predictable
arXiv 2025
ShadowLLM: Predictor-based Contextual Sparsity for Large Language Models
arXiv 2024
BitMoD: Bit-serial Mixture-of-Datatype LLM Acceleration
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers