Dawei Yin
- Papers
- 36
Cite
Notes
Only stored in your browser.
Authored papers
36Agentic-R: Learning to Retrieve for Agentic Search
arXiv 2026
Qianfan-OCR: A Unified End-to-End Model for Document Intelligence
arXiv 2026
MatchTIR: Fine-Grained Supervision for Tool-Integrated Reasoning via Bipartite Matching
arXiv 2026
Measuring Maximum Activations in Open Large Language Models
arXiv 2026
EndPrompt: Efficient Long-Context Extension via Terminal Anchoring
arXiv 2026
Reinforced Efficient Reasoning via Semantically Diverse Exploration
arXiv 2026
VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos
arXiv 2025
Retrieval Models Aren't Tool-Savvy: Benchmarking Tool Retrieval for Large Language Models
arXiv 2025
COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence
arXiv 2025
Debiasing Multimodal Large Language Models via Noise-Aware Preference Optimization
CVPR 2025 1
MathReal: We Keep It Real! A Real Scene Benchmark for Evaluating Math Reasoning in Multimodal Large Language Models
arXiv 2025
Reasoning-to-Defend: Safety-Aware Reasoning Can Defend Large Language Models from Jailbreaking
arXiv 2025
Staying in the Sweet Spot: Responsive Reasoning Evolution via Capability-Adaptive Hint Scaffolding
arXiv 2025
ReasonRank: Empowering Passage Ranking with Strong Reasoning Ability
arXiv 2025
CurES: From Gradient Analysis to Efficient Curriculum Learning for Reasoning LLMs
arXiv 2025
Improving Retrieval-Augmented Generation through Multi-Agent Reinforcement Learning
arXiv 2025
Sliding Windows Are Not the End: Exploring Full Ranking with Long-Context Large Language Models
arXiv 2024
ATM: Adversarial Tuning Multi-agent System Makes a Robust Retrieval-Augmented Generator
arXiv 2024
HiGPT: Heterogeneous Graph Language Model
arXiv 2024
JAILJUDGE: A Comprehensive Jailbreak Judge Benchmark with Multi-Agent Enhanced Explanation Evaluation Framework
arXiv 2024
From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions
arXiv 2024
Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning
arXiv 2024
MAIR: A Massive Benchmark for Evaluating Instructed Retrieval
arXiv 2024
PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization
arXiv 2024
Cross-model Control: Improving Multiple Large Language Models in One-time Training
arXiv 2024
InverseCoder: Self-improving Instruction-Tuned Code LLMs with Inverse-Instruct
arXiv 2024
The Butterfly Effect of Model Editing: Few Edits Can Trigger Large Language Models Collapse
arXiv 2024
G3: An Effective and Adaptive Framework for Worldwide Geolocalization Using Large Multi-Modality Models
arXiv 2024
Learning to Use Tools via Cooperative and Interactive Agents
arXiv 2024
Representation Learning with Large Language Models for Recommendation
arXiv 2023
GraphGPT: Graph Instruction Tuning for Large Language Models
arXiv 2023
Disentangled Contrastive Collaborative Filtering
arXiv 2023
LLMRec: Large Language Models with Graph Augmentation for Recommendation
arXiv 2023
MILL: Mutual Verification with Large Language Models for Zero-Shot Query Expansion
arXiv 2023
Instruction Distillation Makes Large Language Models Efficient Zero-shot Rankers
arXiv 2023
A Large Scale Search Dataset for Unbiased Learning to Rank
arXiv 2022
Affiliations
Frequent co-authors
10from 36 papers