Jianyu Wei
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5MiMo-V2-Flash Technical Report
arXiv 2026
Vec-LUT: Vector Table Lookup for Parallel Ultra-Low-Bit LLM Inference on Edge Devices
arXiv 2025
T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on Edge
arXiv 2024
AFPQ: Asymmetric Floating Point Quantization for LLMs
arXiv 2023
Pre-gated MoE: An Algorithm-System Co-Design for Fast and Scalable Mixture-of-Expert Inference
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers