Jianwei Yin
- Papers
- 14
Cite
Notes
Only stored in your browser.
Authored papers
14GFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification
arXiv 2026
MM-CondChain: A Programmatically Verified Benchmark for Visually Grounded Deep Compositional Reasoning
arXiv 2026
CollabEdit: Towards Non-destructive Collaborative Knowledge Editing
arXiv 2024
GUI Testing Arena: A Unified Benchmark for Advancing Autonomous GUI Testing Agent
arXiv 2024
ImageRAG: Enhancing Ultra High Resolution Remote Sensing Imagery Analysis with ImageRAG
arXiv 2024
ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration
arXiv 2024
RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback
arXiv 2024
Tool-Planner: Task Planning with Clusters across Multiple Tools
arXiv 2024
ERA-CoT: Improving Chain-of-Thought through Entity Relationship Analysis
arXiv 2024
Bridging Context Gaps: Leveraging Coreference Resolution for Long Contextual Understanding
arXiv 2024
RS5M and GeoRSCLIP: A Large Scale Vision-Language Dataset and A Large Vision-Language Model for Remote Sensing
arXiv 2023
GroundVLP: Harnessing Zero-shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection
arXiv 2023
Benchmarking Sequential Visual Input Reasoning and Prediction in Multimodal Large Language Models
arXiv 2023
VL-CheckList: Evaluating Pre-trained Vision-Language Models with Objects, Attributes and Relations
arXiv 2022
Affiliations
Frequent co-authors
10from 14 papers