Shusheng Yang

Papers: 8

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

8papers

Authored papers

VideoNSA: Native Sparse Attention Scales Video Understanding

arXiv 2025

2025

Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs

arXiv 2024

2024

Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces

CVPR 2025 1

2024

Qwen Technical Report

arXiv 2023

2023

Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond

arXiv 2023

2023

ViTMatte: Boosting Image Matting with Pretrained Plain Vision Transformers

arXiv 2023

2023

TouchStone: Evaluating Vision-Language Models by Language Models

arXiv 2023

2023

Unleashing Vanilla Vision Transformer with Masked Image Modeling for Object Detection

ICCV 2023 1

2022

Affiliations

No known affiliations.

Frequent co-authors

from 8 papers

Chang Zhou

3 shared papers

Jingren Zhou

3 shared papers

Jinze Bai

3 shared papers

Junyang Lin

researcher

Peng Wang

Shijie Wang

Shuai Bai

Xinggang Wang

Jihan Yang

Saining Xie