Haoyu Lu
- Papers
- 15
Cite
Notes
Only stored in your browser.
Authored papers
15Attention Residuals
arXiv 2026
Towards Pixel-Level VLM Perception via Simple Points Prediction
arXiv 2026
Kimi K2.5: Visual Agentic Intelligence
arXiv 2026
BabyVision: Visual Reasoning Beyond Language
arXiv 2026
WorldVQA: Measuring Atomic World Knowledge in Multimodal Large Language Models
arXiv 2026
Kimi-VL Technical Report
arXiv 2025
R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization
arXiv 2025
Kimi k1.5: Scaling Reinforcement Learning with LLMs
arXiv 2025
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
arXiv 2024
DeepSeek-VL: Towards Real-World Vision-Language Understanding
arXiv 2024
Needle In A Video Haystack: A Scalable Synthetic Evaluator for Video MLLMs
arXiv 2024
Beyond Filtering: Adaptive Image-Text Quality Enhancement for MLLM Pretraining
arXiv 2024
Towards Event-oriented Long Video Understanding
arXiv 2024
VDT: General-purpose Video Diffusion Transformers via Mask Modeling
arXiv 2023
UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling
arXiv 2023
Affiliations
Frequent co-authors
10from 15 papers