Zeyuan Chen

Papers: 13

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

13papers

Authored papers

X-Dyna: Expressive Dynamic Human Image Animation

CVPR 2025 1

2025

VLM2Vec-V2: Advancing Multimodal Embedding for Videos, Images, and Visual Documents

arXiv 2025

2025

GTA1: GUI Test-time Scaling Agent

arXiv 2025

2025

UNIDOC-BENCH: A Unified Benchmark for Document-Centric Multimodal RAG

arXiv 2025

2025

CoDA: Coding LM via Diffusion Adaptation

arXiv 2025

2025

SQ-LLaVA: Self-Questioning for Large Vision-Language Assistant

arXiv 2024

2024

ProVision: Programmatically Scaling Vision-centric Instruction Data for Multimodal Language Models

arXiv 2024

2024

"Do Anything Now": Characterizing and Evaluating In-The-Wild Jailbreak Prompts on Large Language Models

arXiv 2023

2023

BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions

arXiv 2023

2023

BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents

arXiv 2023

2023

MGTBench: Benchmarking Machine-Generated Text Detection

arXiv 2023

2023

Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization

arXiv 2023

2023

LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer

arXiv 2022

2022

Affiliations

No known affiliations.

Frequent co-authors

from 13 papers

ran Xu

9 shared papers

Caiming Xiong

researcher

8 shared papers

Juan Carlos Niebles

4 shared papers

Silvio Savarese

researcher

Can Qin

Huan Wang

JianGuo Zhang

Le Xue

Shelby Heinecke

Weiran Yao