Shuo Chen

Papers: 16

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

16papers

Authored papers

Context Forcing: Consistent Autoregressive Video Generation with Long Context

arXiv 2026

2026

METok: Multi-Stage Event-based Token Compression for Efficient Long Video Understanding

arXiv 2025

2025

GroundAct: Can LLM Agents Ground Actions in Environmental States?

arXiv 2025

2025

DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion

arXiv 2024

2024

CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents

arXiv 2024

2024

Motion Avatar: Generate Human and Animal Avatars with Arbitrary Motion

arXiv 2024

2024

Compositional 4D Dynamic Scenes Understanding with Physics Priors for Video Question Answering

arXiv 2024

2024

Stop Reasoning! When Multimodal LLM with Chain-of-Thought Reasoning Meets Adversarial Image

arXiv 2024

2024

Multimodal Pragmatic Jailbreak on Text-to-image Models

arXiv 2024

2024

PromptKD: Unsupervised Prompt Distillation for Vision-Language Models

CVPR 2024 1

2024

Red Teaming GPT-4V: Are GPT-4V Safe Against Uni/Multi-Modal Jailbreak Attacks?

arXiv 2024

2024

Creative Birds: Self-Supervised Single-View 3D Style Transfer

ICCV 2023 1

2023

A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models

arXiv 2023

2023

PVO: Panoptic Visual Odometry

CVPR 2023 1

2022

IntrinsicNeRF: Learning Intrinsic Neural Radiance Fields for Editable Novel View Synthesis

ICCV 2023 1

2022

Contrastive Embedding for Generalized Zero-Shot Learning

CVPR 2021 1

2021

Affiliations

No known affiliations.

Frequent co-authors

from 16 papers

Volker Tresp

Jindong Gu

Philip Torr

Jian Yang

Zhen Han

Bailan He

Gengyuan Zhang

Guofeng Zhang

Hujun Bao

Weicai Ye