Qingyun Li

Papers: 15

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

15papers

Authored papers

OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agent

arXiv 2026

2026

ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data

arXiv 2025

2025

MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents

arXiv 2025

2025

Keeping Yourself is Important in Downstream Tuning Multimodal Large Language Model

arXiv 2025

2025

A Simple Aerial Detection Baseline of Multimodal Language Models

arXiv 2025

2025

A Simple Aerial Detection Baseline of Multimodal Language Models

arXiv 2025

2025

EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering

CVPR 2025 1

2025

Co-Training Vision Language Models for Remote Sensing Multi-task Learning

arXiv 2025

2025

The All-Seeing Project V2: Towards General Relation Comprehension of the Open World

arXiv 2024

2024

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

arXiv 2024

2024

GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding

arXiv 2024

2024

FLoRA: Low-Rank Core Space for N-dimension

arXiv 2024

2024

Point2RBox: Combine Knowledge from Synthetic Visual Patterns for End-to-end Oriented Object Detection with Single Point Supervision

CVPR 2024 1

2023

ARS-DETR: Aspect Ratio-Sensitive Detection Transformer for Aerial Oriented Object Detection

arXiv 2023

2023

PointOBB: Learning Oriented Object Detection via Single Point Supervision

CVPR 2024 1

2023

Affiliations

No known affiliations.

Frequent co-authors

from 15 papers

Xue Yang

Yu Qiao

Jifeng Dai

Yi Yu

Yushi Chen

Junchi Yan

Weiyun Wang

Wenhai Wang

Bowen Yang

JingJing Xie