Qingyun Li
- Papers
- 15
Cite
Notes
Only stored in your browser.
Authored papers
15OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agent
arXiv 2026
ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data
arXiv 2025
MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents
arXiv 2025
Keeping Yourself is Important in Downstream Tuning Multimodal Large Language Model
arXiv 2025
A Simple Aerial Detection Baseline of Multimodal Language Models
arXiv 2025
A Simple Aerial Detection Baseline of Multimodal Language Models
arXiv 2025
EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering
CVPR 2025 1
Co-Training Vision Language Models for Remote Sensing Multi-task Learning
arXiv 2025
The All-Seeing Project V2: Towards General Relation Comprehension of the Open World
arXiv 2024
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
arXiv 2024
GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding
arXiv 2024
FLoRA: Low-Rank Core Space for N-dimension
arXiv 2024
Point2RBox: Combine Knowledge from Synthetic Visual Patterns for End-to-end Oriented Object Detection with Single Point Supervision
CVPR 2024 1
ARS-DETR: Aspect Ratio-Sensitive Detection Transformer for Aerial Oriented Object Detection
arXiv 2023
PointOBB: Learning Oriented Object Detection via Single Point Supervision
CVPR 2024 1
Affiliations
Frequent co-authors
10from 15 papers