Jun Yu
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13BayesRAG: Probabilistic Mutual Evidence Corroboration for Multimodal Retrieval-Augmented Generation
arXiv 2026
Beyond Unimodal Shortcuts: MLLMs as Cross-Modal Reasoners for Grounded Named Entity Recognition
arXiv 2026
Video-ToC: Video Tree-of-Cue Reasoning
arXiv 2026
FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces
arXiv 2025
A Token is Worth over 1,000 Tokens: Efficient Knowledge Distillation through Low-Rank Clone
arXiv 2025
Towards Text-Image Interleaved Retrieval
arXiv 2025
KaLM-Embedding-V2: Superior Training Techniques and Data Inspire A Versatile Embedding Model
arXiv 2025
Imp: Highly Capable Large Multimodal Models for Mobile Devices
arXiv 2024
BiomedGPT: A Generalist Vision-Language Foundation Model for Diverse Biomedical Tasks
arXiv 2023
The RoboDepth Challenge: Methods and Advancements Towards Robust Depth Estimation
arXiv 2023
Prophet: Prompting Large Language Models with Complementary Answer Heuristics for Knowledge-based Visual Question Answering
CVPR 2023 1
Graph Matching with Bi-level Noisy Correspondence
ICCV 2023 1
ActivityNet-QA: A Dataset for Understanding Complex Web Videos via Question Answering
arXiv 2019
Affiliations
Frequent co-authors
10from 13 papers