Xilin Chen
- Papers
- 15
Cite
Notes
Only stored in your browser.
Authored papers
15EfficientMT: Efficient Temporal Adaptation for Motion Transfer in Text-to-Video Diffusion Models
ICCV 2025
BitVLA: 1-bit Vision-Language-Action Models for Robotics Manipulation
arXiv 2025
Jodi: Unification of Visual Generation and Understanding via Joint Modeling
arXiv 2025
un$^2$CLIP: Improving CLIP's Visual Detail Capturing Ability via Inverting unCLIP
arXiv 2025
DIVE: Inverting Conditional Diffusion Models for Discriminative Tasks
arXiv 2025
Assimilation Matters: Model-level Backdoor Detection in Vision-Language Pretrained Models
arXiv 2025
M4U: Evaluating Multilingual Understanding and Reasoning for Large Multimodal Models
arXiv 2024
T2IShield: Defending Against Backdoors on Text-to-Image Diffusion Models
arXiv 2024
Dysca: A Dynamic and Scalable Benchmark for Evaluating Perception Ability of LVLMs
arXiv 2024
CtrLoRA: An Extensible and Efficient Framework for Controllable Image Generation
arXiv 2024
HPNet: Dynamic Trajectory Forecasting with Historical Prediction Attention
CVPR 2024 1
UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing
CVPR 2025 1
Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework
arXiv 2022
Synchronous Bidirectional Learning for Multilingual Lip Reading
arXiv 2020
Segmentation Transformer: Object-Contextual Representations for Semantic Segmentation
ECCV 2020 8
Affiliations
Frequent co-authors
10from 15 papers