Xiaoyu Liu
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10VITA-E: Natural Embodied Interaction with Concurrent Seeing, Hearing, Speaking, and Acting
arXiv 2025
ViCrit: A Verifiable Reinforcement Learning Proxy Task for Visual Perception in VLMs
arXiv 2025
VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction
arXiv 2025
AutoHallusion: Automatic Generation of Hallucination Benchmarks for Vision-Language Models
arXiv 2024
SmartControl: Enhancing ControlNet for Handling Rough Visual Conditions
arXiv 2024
Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences
arXiv 2024
Beyond Image Borders: Learning Feature Extrapolation for Unbounded Image Composition
ICCV 2023 1
HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language Models
CVPR 2024 1
DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization
arXiv 2023
Explore Spurious Correlations at the Concept Level in Language Models for Text Classification
arXiv 2023
Affiliations
Frequent co-authors
10from 10 papers