Yuzhang Shang
- Papers
- 16
Cite
Notes
Only stored in your browser.
Authored papers
16TIDE: Efficient and Lossless MoE Diffusion LLM Inference with I/O-aware Expert Offload
arXiv 2026
Medical SAM3: A Foundation Model for Universal Prompt-Driven Medical Image Segmentation
arXiv 2026
V-Retrver: Evidence-Driven Agentic Reasoning for Universal Multimodal Retrieval
arXiv 2026
VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate
arXiv 2025
When Tokens Talk Too Much: A Survey of Multimodal Long-Context Token Compression across Images, Videos, and Audios
arXiv 2025
AdaTooler-V: Adaptive Tool-Use for Images and Videos
arXiv 2025
Benchmarking Post-Training Quantization in LLMs: Comprehensive Taxonomy, Unified Evaluation, and Comparative Analysis
arXiv 2025
Can World Simulators Reason? Gen-ViRe: A Generative Visual Reasoning Benchmark
arXiv 2025
PTQ1.61: Push the Real Limit of Extremely Low-Bit Post-Training Quantization Methods for Large Language Models
arXiv 2025
DLFR-VAE: Dynamic Latent Frame Rate VAE for Video Generation
arXiv 2025
LLM Inference Unveiled: Survey and Roofline Model Insights
arXiv 2024
Robin3D: Improving 3D Large Language Model via Robust Instruction Tuning
ICCV 2025
QuEST: Low-bit Diffusion Model Quantization via Efficient Selective Finetuning
ICCV 2025
TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models
arXiv 2024
PB-LLM: Partially Binarized Large Language Models
arXiv 2023
Post-training Quantization on Diffusion Models
CVPR 2023 1
Affiliations
Frequent co-authors
10from 16 papers