Fuxiao Liu

Papers: 12

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

12papers

Authored papers

MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data

arXiv 2026

2026

Towards Multimodal Lifelong Understanding: A Dataset and Agentic Baseline

arXiv 2026

2026

Self-Rewarding Vision-Language Model via Reasoning Decomposition

arXiv 2025

2025

First Frame Is the Place to Go for Video Content Customization

arXiv 2025

2025

A Survey of State of the Art Large Vision Language Models: Alignment, Benchmark, Evaluations and Challenges

arXiv 2025

2025

ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness

arXiv 2025

2025

Mosaic-IT: Free Compositional Data Augmentation Improves Instruction Tuning

arXiv 2024

2024

DocumentCLIP: Linking Figures and Main Body Text in Reflowed Documents

arXiv 2023

2023

MMC: Advancing Multimodal Chart Understanding with Large-scale Instruction Tuning

arXiv 2023

2023

HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language Models

CVPR 2024 1

2023

Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning

arXiv 2023

2023

Visual News: Benchmark and Challenges in News Image Captioning

EMNLP 2021 11

2020

Affiliations

No known affiliations.

Frequent co-authors

from 12 papers

Zongxia Li

Xiyang Wu

Tianyi Zhou

Yaser Yacoob

Chengsong Huang

Dong Yu

Guangyao Shi

Hongyang Du

Ming Li

Yijun Liang