0

Yanzhi Wang

Papers
20

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
20papers

Authored papers

20

A Very Big Video Reasoning Suite

arXiv 2026

2026

DraftAttention: Fast Video Diffusion via Low-Resolution Attention Guidance

arXiv 2025

2025

VOTE: Vision-Language-Action Optimization with Trajectory Ensemble Voting

arXiv 2025

2025

LightCache: Memory-Efficient, Training-Free Acceleration for Video Generation

arXiv 2025

2025

Efficient Reasoning with Hidden Thinking

arXiv 2025

2025

Taming Diffusion for Dataset Distillation with High Representativeness

arXiv 2025

2025

QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge

quartdepth-post-training-quantization-for

2025

Enabling Flexible Multi-LLM Integration for Scalable Knowledge Aggregation

arXiv 2025

2025

Fully Open Source Moxin-7B Technical Report

arXiv 2024

2024

LazyDiT: Lazy Learning for the Acceleration of Diffusion Transformers

arXiv 2024

2024

Rethinking Token Reduction for State Space Models

arXiv 2024

2024

Fast and Memory-Efficient Video Diffusion Using Streamlined Inference

arXiv 2024

2024

Search for Efficient Large Language Models

arXiv 2024

2024

EdgeQAT: Entropy and Distribution Guided Quantization-Aware Training for the Acceleration of Lightweight LLMs on the Edge

arXiv 2024

2024

DeepMAD: Mathematical Architecture Design for Deep Convolutional Neural Network

CVPR 2023 1

2023

Can Adversarial Examples Be Parsed to Reveal Victim Model Information?

arXiv 2023

2023

Agile-Quant: Activation-Guided Quantization for Faster Inference of LLMs on the Edge

arXiv 2023

2023

SparCL: Sparse Continual Learning on the Edge

arXiv 2022

2022

Advancing Model Pruning via Bi-level Optimization

arXiv 2022

2022

Pruning Adversarially Robust Neural Networks without Adversarial Examples

arXiv 2022

2022

Affiliations

No known affiliations.

Frequent co-authors

10

from 20 papers