Jun Song
- Papers
- 12
Cite
Notes
Only stored in your browser.
Authored papers
12The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models
arXiv 2026
Compression with Global Guidance: Towards Training-free High-Resolution MLLMs Acceleration
arXiv 2025
VStyle: A Benchmark for Voice Style Adaptation with Spoken Instructions
arXiv 2025
DeepPHY: Benchmarking Agentic VLMs on Physical Reasoning
arXiv 2025
"See the World, Discover Knowledge": A Chinese Factuality Evaluation for Large Vision Language Models
arXiv 2025
Laser: Efficient Language-Guided Segmentation in Neural Radiance Fields
arXiv 2025
RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness
CVPR 2025 1
Align Anything: Training All-Modality Models to Follow Instructions with Language Feedback
arXiv 2024
ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal Models
arXiv 2024
Dreamer XL: Towards High-Resolution Text-to-3D Generation via Trajectory Score Matching
arXiv 2024
LongDocURL: a Comprehensive Multimodal Long Document Benchmark Integrating Understanding, Reasoning, and Locating
arXiv 2024
LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer
arXiv 2024
Affiliations
Frequent co-authors
10from 12 papers