Xinlei Yu
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10Memento-Skills: Let Agents Design Agents
arXiv 2026
The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook
arXiv 2026
4DThinker: Thinking with 4D Imagery for Dynamic Spatial Understanding
arXiv 2026
Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models
arXiv 2026
VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models
arXiv 2025
Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks
arXiv 2025
CRISP-SAM2: SAM2 with Cross-Modal Interaction and Semantic Prompting for Multi-Organ Segmentation
arXiv 2025
Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views
arXiv 2025
Visual Multi-Agent System: Mitigating Hallucination Snowballing via Visual Flow
arXiv 2025
Visual Document Understanding and Question Answering: A Multi-Agent Collaboration Framework with Test-Time Scaling
arXiv 2025
Affiliations
Frequent co-authors
10from 10 papers