Huazhong Yang
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing
arXiv 2025
Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models
arXiv 2025
Evaluating Quantized Large Language Models
arXiv 2024
MBQ: Modality-Balanced Quantization for Large Vision-Language Models
CVPR 2025 1
FrameFusion: Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models
ICCV 2025
Can LLMs Learn by Teaching for Better Reasoning? A Preliminary Study
arXiv 2024
MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression
arXiv 2024
Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better
arXiv 2024
ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation
arXiv 2024
Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation
arXiv 2023
Affiliations
Frequent co-authors
10from 10 papers