0

Wei Huang

Papers
26

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
26papers

Authored papers

26

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

arXiv 2026

2026

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

arXiv 2026

2026

SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing

arXiv 2026

2026

LongLive-2.0: An NVFP4 Parallel Infrastructure for Long Video Generation

arXiv 2026

2026

The Best of the Two Worlds: Harmonizing Semantic and Hash IDs for Sequential Recommendation

arXiv 2025

2026

Scaling RL to Long Videos

arXiv 2025

2025

MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO

arXiv 2025

2025

Scaling Diffusion Transformers Efficiently via $μ$P

arXiv 2025

2025

Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

arXiv 2025

2025

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

arXiv 2025

2025

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

arXiv 2025

2025

EmbRACE-3K: Embodied Reasoning and Action in Complex Environments

arXiv 2025

2025

Beyond In-Distribution Success: Scaling Curves of CoT Granularity for Language Model Generalization

arXiv 2025

2025

VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection

CVPR 2025 1

2024

MC-MoE: Mixture Compressor for Mixture-of-Experts LLMs Gains More

arXiv 2024

2024

SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models

arXiv 2024

2024

SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining

arXiv 2024

2024

Breaking the Stage Barrier: A Novel Single-Stage Approach to Long Context Extension for Large Language Models

arXiv 2024

2024

BiLLM: Pushing the Limit of Post-Training Quantization for LLMs

arXiv 2024

2024

An empirical study of LLaMA3 quantization: from LLMs to MLLMs

arXiv 2024

2024

MapSAM: Adapting Segment Anything Model for Automated Feature Detection in Historical Maps

arXiv 2024

2024

On Mesa-Optimization in Autoregressively Trained Transformers: Emergence and Capability

arXiv 2024

2024

Auto-scaling Vision Transformers without Training

auto-scaling-vision-transformers-without

2022

SAFARI: Versatile and Efficient Evaluations for Robustness of Interpretability

ICCV 2023 1

2022

PD-GAN: Probabilistic Diverse GAN for Image Inpainting

CVPR 2021 1

2021

Rethinking Image Inpainting via a Mutual Encoder-Decoder with Feature Equalizations

ECCV 2020 8

2020

Affiliations

No known affiliations.

Frequent co-authors

10

from 26 papers