Bin Qin
- Papers
- 8
Cite
Notes
Only stored in your browser.
8papers
Authored papers
8LLaVA-OneVision-2: Towards Next-Generation Perceptual Intelligence
arXiv 2026
OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence
arXiv 2026
DanQing: An Up-to-Date Large-Scale Chinese Vision-Language Pre-training Dataset
arXiv 2026
MCGA: A Multi-task Classical Chinese Literary Genre Audio Corpus
arXiv 2026
OmniJigsaw: Enhancing Omni-Modal Reasoning via Modality-Orchestrated Reordering
arXiv 2026
LLaVA-OneVision-1.5: Fully Open Framework for Democratized Multimodal Training
arXiv 2025
BTL-UI: Blink-Think-Link Reasoning Model for GUI Agent
arXiv 2025
Partial FC: Training 10 Million Identities on a Single Machine
arXiv 2020
Affiliations
No known affiliations.
Frequent co-authors
10from 8 papers