Ruihao Gong
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture
arXiv 2026
Focus-dLLM: Accelerating Long-Context Diffusion LLM Inference via Confidence-Guided Context Focusing
arXiv 2026
Light Forcing: Accelerating Autoregressive Video Diffusion via Sparse Attention
arXiv 2026
InCoder-32B: Code Foundation Model for Industrial Scenarios
arXiv 2026
Pre$^3$: Enabling Deterministic Pushdown Automata for Faster Structured LLM Generation
arXiv 2025
HarmoniCa: Harmonizing Training and Inference for Better Feature Caching in Diffusion Transformer Acceleration
arXiv 2024
Outlier Suppression+: Accurate quantization of large language models by equivalent and optimal shifting and scaling
arXiv 2023
TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models
CVPR 2024 1
QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models
arXiv 2023
Lossy and Lossless (L$^2$) Post-training Model Size Compression
arXiv 2023
Affiliations
Frequent co-authors
10from 10 papers