Fuxiao Liu
- Papers
- 12
Cite
Notes
Only stored in your browser.
Authored papers
12MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data
arXiv 2026
Towards Multimodal Lifelong Understanding: A Dataset and Agentic Baseline
arXiv 2026
Self-Rewarding Vision-Language Model via Reasoning Decomposition
arXiv 2025
First Frame Is the Place to Go for Video Content Customization
arXiv 2025
ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness
arXiv 2025
A Survey of State of the Art Large Vision Language Models: Alignment, Benchmark, Evaluations and Challenges
arXiv 2025
Mosaic-IT: Free Compositional Data Augmentation Improves Instruction Tuning
arXiv 2024
MMC: Advancing Multimodal Chart Understanding with Large-scale Instruction Tuning
arXiv 2023
HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language Models
CVPR 2024 1
Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning
arXiv 2023
DocumentCLIP: Linking Figures and Main Body Text in Reflowed Documents
arXiv 2023
Visual News: Benchmark and Challenges in News Image Captioning
EMNLP 2021 11
Affiliations
Frequent co-authors
10from 12 papers