Weihao Yu

Papers: 18

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

18papers

Authored papers

Vision-Language-Action Safety: Threats, Challenges, Evaluations, and Mechanisms

arXiv 2026

2026

NoLan: Mitigating Object Hallucinations in Large Vision-Language Models via Dynamic Suppression of Language Priors

arXiv 2026

2026

Seed1.5-VL Technical Report

arXiv 2025

2025

MonoSplat: Generalizable 3D Gaussian Splatting from Monocular Depth Foundation Models

CVPR 2025 1

2025

X$^{2}$-Gaussian: 4D Radiative Gaussian Splatting for Continuous-time Tomographic Reconstruction

ICCV 2025

2025

Artificial Hippocampus Networks for Efficient Long-Context Modeling

arXiv 2025

2025

Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward

arXiv 2025

2025

GeoT: Geometry-guided Instance-dependent Transition Matrix for Semi-supervised Tooth Point Cloud Segmentation

arXiv 2025

2025

MambaOut: Do We Really Need Mamba for Vision?

CVPR 2025 1

2024

LinFusion: 1 GPU, 1 Minute, 16K Image

arXiv 2024

2024

MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities

arXiv 2024

2024

KAN or MLP: A Fairer Comparison

arXiv 2024

2024

Attention Prompting on Image for Large Vision-Language Models

arXiv 2024

2024

Inception Transformer

arXiv 2022

2022

Mugs: A Multi-Granular Self-Supervised Learning Framework

arXiv 2022

2022

Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet

ICCV 2021 10

2021

ConvBERT: Improving BERT with Span-based Dynamic Convolution

NeurIPS 2020 12

2020

ReClor: A Reading Comprehension Dataset Requiring Logical Reasoning

ICLR 2020 1

2020

Affiliations

No known affiliations.

Frequent co-authors

from 18 papers

Xinchao Wang

Jiashi Feng

Runpeng Yu

Shuicheng Yan

Chenxin Li

Yixuan Yuan

Chenyang Si

Li Yuan

Lingfeng Ren

Pan Zhou