Pengjun Xie
- Papers
- 42
Cite
Notes
Only stored in your browser.
Authored papers
42Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking
arXiv 2026
ArenaRL: Scaling RL for Open-Ended Agents via Tournament-based Relative Ranking
arXiv 2026
Rethinking Composed Image Retrieval Evaluation: A Fine-Grained Benchmark from Image Editing
arXiv 2026
LaSER: Internalizing Explicit Reasoning into Latent Space for Dense Retrieval
arXiv 2026
Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models
arXiv 2025
ZeroSearch: Incentivize the Search Capability of LLMs without Searching
arXiv 2025
VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforcement Learning
arXiv 2025
ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents
arXiv 2025
MASKSEARCH: A Universal Pre-Training Framework to Enhance Agentic Search Capability
arXiv 2025
WebShaper: Agentically Data Synthesizing via Information-Seeking Formalization
arXiv 2025
Scaling Agents via Continual Pre-training
arXiv 2025
WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent
arXiv 2025
E^2Rank: Your Text Embedding can Also be an Effective and Efficient Listwise Reranker
arXiv 2025
Agentic Knowledgeable Self-awareness
arXiv 2025
IterResearch: Rethinking Long-Horizon Agents via Markovian State Reconstruction
arXiv 2025
Demystifying deep search: a holistic evaluation with hint-free multi-hop questions and factorised metrics
arXiv 2025
LaRA: Benchmarking Retrieval-Augmented Generation and Long-Context LLMs - No Silver Bullet for LC or RAG Routing
arXiv 2025
Qwen3Guard Technical Report
arXiv 2025
WebWeaver: Structuring Web-Scale Evidence with Dynamic Outlines for Open-Ended Deep Research
arXiv 2025
WebSailor: Navigating Super-human Reasoning for Web Agent
arXiv 2025
ReSum: Unlocking Long-Horizon Search Intelligence via Context Summarization
arXiv 2025
WebDancer: Towards Autonomous Information Seeking Agency
arXiv 2025
WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning
arXiv 2025
Scaling Generalist Data-Analytic Agents
arXiv 2025
AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis
arXiv 2025
OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking
arXiv 2025
Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization
arXiv 2025
Towards Text-Image Interleaved Retrieval
arXiv 2025
SynWorld: Virtual Scenario Synthesis for Agentic Action Knowledge Refinement
arXiv 2025
Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent
arXiv 2024
Agent Planning with World Knowledge Model
arXiv 2024
Benchmarking Agentic Workflow Generation
arXiv 2024
mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval
arXiv 2024
CoFE-RAG: A Comprehensive Full-chain Evaluation Framework for Retrieval-Augmented Generation with Enhanced Data Diversity
arXiv 2024
Improving Retrieval Augmented Open-Domain Question-Answering with Vectorized Contexts
arXiv 2024
SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding
arXiv 2023
A Two-Stage Adaptation of Large Language Models for Text Ranking
arXiv 2023
Language Models are Universal Embedders
arXiv 2023
Multi-CPR: A Multi Domain Chinese Dataset for Passage Retrieval
arXiv 2022
Robust Self-Augmentation for Named Entity Recognition with Meta Reweighting
NAACL 2022 7
AISHELL-NER: Named Entity Recognition from Chinese Speech
arXiv 2022
Few-NERD: A Few-Shot Named Entity Recognition Dataset
ACL 2021 5
Affiliations
Frequent co-authors
10from 42 papers