Shuo Chen
- Papers
- 16
Cite
Notes
Only stored in your browser.
Authored papers
16Context Forcing: Consistent Autoregressive Video Generation with Long Context
arXiv 2026
GroundAct: Can LLM Agents Ground Actions in Environmental States?
arXiv 2025
METok: Multi-Stage Event-based Token Compression for Efficient Long Video Understanding
arXiv 2025
DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion
arXiv 2024
CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents
arXiv 2024
PromptKD: Unsupervised Prompt Distillation for Vision-Language Models
CVPR 2024 1
Motion Avatar: Generate Human and Animal Avatars with Arbitrary Motion
arXiv 2024
Compositional 4D Dynamic Scenes Understanding with Physics Priors for Video Question Answering
arXiv 2024
Stop Reasoning! When Multimodal LLM with Chain-of-Thought Reasoning Meets Adversarial Image
arXiv 2024
Multimodal Pragmatic Jailbreak on Text-to-image Models
arXiv 2024
Red Teaming GPT-4V: Are GPT-4V Safe Against Uni/Multi-Modal Jailbreak Attacks?
arXiv 2024
A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models
arXiv 2023
Creative Birds: Self-Supervised Single-View 3D Style Transfer
ICCV 2023 1
PVO: Panoptic Visual Odometry
CVPR 2023 1
IntrinsicNeRF: Learning Intrinsic Neural Radiance Fields for Editable Novel View Synthesis
ICCV 2023 1
Contrastive Embedding for Generalized Zero-Shot Learning
CVPR 2021 1
Affiliations
Frequent co-authors
10from 16 papers