Hanchi Sun
- Papers
- 4
Cite
Notes
Only stored in your browser.
4papers
Authored papers
4MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU
arXiv 2026
Expert Threshold Routing for Autoregressive Language Modeling with Dynamic Computation Allocation and Load Balancing
arXiv 2026
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models
arXiv 2024
Thinking Before Looking: Improving Multimodal LLM Reasoning via Mitigating Visual Hallucination
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 4 papers