Weihao Yu
- Papers
- 18
Cite
Notes
Only stored in your browser.
Authored papers
18Vision-Language-Action Safety: Threats, Challenges, Evaluations, and Mechanisms
arXiv 2026
NoLan: Mitigating Object Hallucinations in Large Vision-Language Models via Dynamic Suppression of Language Priors
arXiv 2026
Seed1.5-VL Technical Report
arXiv 2025
MonoSplat: Generalizable 3D Gaussian Splatting from Monocular Depth Foundation Models
CVPR 2025 1
Artificial Hippocampus Networks for Efficient Long-Context Modeling
arXiv 2025
Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward
arXiv 2025
X$^{2}$-Gaussian: 4D Radiative Gaussian Splatting for Continuous-time Tomographic Reconstruction
ICCV 2025
GeoT: Geometry-guided Instance-dependent Transition Matrix for Semi-supervised Tooth Point Cloud Segmentation
arXiv 2025
MambaOut: Do We Really Need Mamba for Vision?
CVPR 2025 1
LinFusion: 1 GPU, 1 Minute, 16K Image
arXiv 2024
MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities
arXiv 2024
KAN or MLP: A Fairer Comparison
arXiv 2024
Attention Prompting on Image for Large Vision-Language Models
arXiv 2024
Inception Transformer
arXiv 2022
Mugs: A Multi-Granular Self-Supervised Learning Framework
arXiv 2022
Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
ICCV 2021 10
ConvBERT: Improving BERT with Span-based Dynamic Convolution
NeurIPS 2020 12
ReClor: A Reading Comprehension Dataset Requiring Logical Reasoning
ICLR 2020 1
Affiliations
Frequent co-authors
10from 18 papers