Yonggan Fu
- Papers
- 11
Cite
Notes
Only stored in your browser.
Authored papers
11Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning
arXiv 2025
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration
arXiv 2025
Fast-dLLM v2: Efficient Block-Diffusion LLM
arXiv 2025
LongMamba: Enhancing Mamba's Long Context Capabilities via Training-Free Receptive Field Enlargement
arXiv 2025
Hymba: A Hybrid-head Architecture for Small Language Models
arXiv 2024
MG-Verilog: Multi-grained Dataset Towards Enhanced LLM-assisted Verilog Generation
arXiv 2024
AmoebaLLM: Constructing Any-Shape Large Language Models for Efficient and Instant Deployment
arXiv 2024
Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibration
arXiv 2024
Omni-Recon: Harnessing Image-based Rendering for General-Purpose Neural Radiance Fields
arXiv 2024
NeRFool: Uncovering the Vulnerability of Generalizable Neural Radiance Fields against Adversarial Perturbations
arXiv 2023
Hint-Aug: Drawing Hints from Foundation Vision Transformers Towards Boosted Few-Shot Parameter-Efficient Tuning
CVPR 2023 1
Affiliations
Frequent co-authors
10from 11 papers