Qixiang Ye
- Papers
- 16
Cite
Notes
Only stored in your browser.
Authored papers
16LongVideo-R1: Smart Navigation for Low-cost Long Video Understanding
arXiv 2026
Balancing Understanding and Generation in Discrete Diffusion Models
arXiv 2026
Thinking with Images via Self-Calling Agent
arXiv 2025
Consolidating Reinforcement Learning for Multimodal Discrete Diffusion Models
arXiv 2025
Geometric-Mean Policy Optimization
arXiv 2025
YOLOv12: Attention-Centric Real-Time Object Detectors
arXiv 2025
VMamba: Visual State Space Model
arXiv 2024
vHeat: Building Vision Models upon Heat Conduction
arXiv 2024
Evaluation of Text-to-Video Generation Models: A Dynamics Perspective
arXiv 2024
ClawMachine: Learning to Fetch Visual Tokens for Referential Comprehension
arXiv 2024
Ray Denoising: Depth-aware Hard Negative Sampling for Multi-view 3D Object Detection
arXiv 2024
DynRefer: Delving into Region-level Multi-modality Tasks via Dynamic Resolution
arXiv 2024
Generative Prompt Model for Weakly Supervised Object Localization
ICCV 2023 1
BEiT v2: Masked Image Modeling with Vector-Quantized Visual Tokenizers
arXiv 2022
Fast-iTPN: Integrally Pre-Trained Transformer Pyramid Network with Token Migration
CVPR 2023 1
Integrally Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection
ICCV 2023 1
Affiliations
Frequent co-authors
10from 16 papers