Jian Luan
- Papers
- 19
Cite
Notes
Only stored in your browser.
Authored papers
19Video Streaming Thinking: VideoLLMs Can Watch and Think Simultaneously
arXiv 2026
Visual Para-Thinker: Divide-and-Conquer Reasoning for Visual Comprehension
arXiv 2026
Mobile GUI Agents under Real-world Threats: Are We There Yet?
arXiv 2026
MiDashengLM: Efficient Audio Understanding with General Audio Captions
arXiv 2025
GLAP: General contrastive audio-text pretraining across domains and languages
arXiv 2025
TailorKV: A Hybrid Framework for Long-Context Inference via Tailored KV Cache Optimization
arXiv 2025
Direction-Aware Diagonal Autoregressive Image Generation
arXiv 2025
Xiaomi MiMo-VL-Miloco Technical Report
arXiv 2025
TimeViper: A Hybrid Mamba-Transformer Vision-Language Model for Efficient Long Video Understanding
arXiv 2025
BTL-UI: Blink-Think-Link Reasoning Model for GUI Agent
arXiv 2025
Reinforcement Learning Outperforms Supervised Fine-Tuning: A Case Study on Audio Question Answering
arXiv 2025
Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study
arXiv 2025
KG-Retriever: Efficient Knowledge Indexing for Retrieval-Augmented Large Language Models
arXiv 2024
A Comprehensive Evaluation of Quantization Strategies for Large Language Models
arXiv 2024
Pruning Large Language Models to Intra-module Low-rank Architecture with Transitional Activations
arXiv 2024
MobileVLM: A Vision-Language Model for Better Intra- and Inter-UI Understanding
arXiv 2024
SUBLLM: A Novel Efficient Architecture with Token Sequence Subsampling for LLM
arXiv 2024
Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents
arXiv 2024
ToolPlanner: A Tool Augmented LLM for Multi Granularity Instructions with Path Planning and Feedback
arXiv 2024
Affiliations
Frequent co-authors
10from 19 papers