Yue Fan
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery
arXiv 2026
SafeGround: Know When to Trust GUI Grounding Models via Uncertainty Calibration
arXiv 2026
Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practices
CVPR 2025 1
GRIT: Teaching MLLMs to Think with Images
arXiv 2025
Multimodal Inconsistency Reasoning (MMIR): A New Benchmark for Multimodal Reasoning Models
arXiv 2025
MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos
arXiv 2024
Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding
arXiv 2024
TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters
arXiv 2024
LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language Models
arXiv 2023
USB: A Unified Semi-supervised Learning Benchmark for Classification
arXiv 2022
Affiliations
Frequent co-authors
10from 10 papers