Yanzhi Wang
- Papers
- 20
Cite
Notes
Only stored in your browser.
Authored papers
20A Very Big Video Reasoning Suite
arXiv 2026
DraftAttention: Fast Video Diffusion via Low-Resolution Attention Guidance
arXiv 2025
VOTE: Vision-Language-Action Optimization with Trajectory Ensemble Voting
arXiv 2025
LightCache: Memory-Efficient, Training-Free Acceleration for Video Generation
arXiv 2025
Efficient Reasoning with Hidden Thinking
arXiv 2025
Taming Diffusion for Dataset Distillation with High Representativeness
arXiv 2025
QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge
quartdepth-post-training-quantization-for
Enabling Flexible Multi-LLM Integration for Scalable Knowledge Aggregation
arXiv 2025
Fully Open Source Moxin-7B Technical Report
arXiv 2024
LazyDiT: Lazy Learning for the Acceleration of Diffusion Transformers
arXiv 2024
Rethinking Token Reduction for State Space Models
arXiv 2024
Fast and Memory-Efficient Video Diffusion Using Streamlined Inference
arXiv 2024
Search for Efficient Large Language Models
arXiv 2024
EdgeQAT: Entropy and Distribution Guided Quantization-Aware Training for the Acceleration of Lightweight LLMs on the Edge
arXiv 2024
DeepMAD: Mathematical Architecture Design for Deep Convolutional Neural Network
CVPR 2023 1
Can Adversarial Examples Be Parsed to Reveal Victim Model Information?
arXiv 2023
Agile-Quant: Activation-Guided Quantization for Faster Inference of LLMs on the Edge
arXiv 2023
SparCL: Sparse Continual Learning on the Edge
arXiv 2022
Advancing Model Pruning via Bi-level Optimization
arXiv 2022
Pruning Adversarially Robust Neural Networks without Adversarial Examples
arXiv 2022
Affiliations
Frequent co-authors
10from 20 papers