Shiming Xiang
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9CC-VQA: Conflict- and Correlation-Aware Method for Mitigating Knowledge Conflict in Knowledge-Based Visual Question Answering
arXiv 2026
Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization
arXiv 2025
R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning
arXiv 2025
IF-Bench: Benchmarking and Enhancing MLLMs for Infrared Images with Generative Visual Prompting
arXiv 2025
UNIP: Rethinking Pre-trained Attention Patterns for Infrared Semantic Segmentation
arXiv 2025
Continuous Speculative Decoding for Autoregressive Image Generation
arXiv 2024
AVESFormer: Efficient Transformer Design for Real-Time Audio-Visual Segmentation
arXiv 2024
Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-Visual Segmentation
CVPR 2024 1
Expanding Language-Image Pretrained Models for General Video Recognition
arXiv 2022
Affiliations
Frequent co-authors
10from 9 papers