0

Pengjun Xie

Papers
42

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
42papers

Authored papers

42

Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking

arXiv 2026

2026

ArenaRL: Scaling RL for Open-Ended Agents via Tournament-based Relative Ranking

arXiv 2026

2026

Rethinking Composed Image Retrieval Evaluation: A Fine-Grained Benchmark from Image Editing

arXiv 2026

2026

LaSER: Internalizing Explicit Reasoning into Latent Space for Dense Retrieval

arXiv 2026

2026

Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

arXiv 2025

2025

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

arXiv 2025

2025

VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforcement Learning

arXiv 2025

2025

ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents

arXiv 2025

2025

MASKSEARCH: A Universal Pre-Training Framework to Enhance Agentic Search Capability

arXiv 2025

2025

WebShaper: Agentically Data Synthesizing via Information-Seeking Formalization

arXiv 2025

2025

Scaling Agents via Continual Pre-training

arXiv 2025

2025

WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent

arXiv 2025

2025

E^2Rank: Your Text Embedding can Also be an Effective and Efficient Listwise Reranker

arXiv 2025

2025

Agentic Knowledgeable Self-awareness

arXiv 2025

2025

IterResearch: Rethinking Long-Horizon Agents via Markovian State Reconstruction

arXiv 2025

2025

Demystifying deep search: a holistic evaluation with hint-free multi-hop questions and factorised metrics

arXiv 2025

2025

LaRA: Benchmarking Retrieval-Augmented Generation and Long-Context LLMs - No Silver Bullet for LC or RAG Routing

arXiv 2025

2025

Qwen3Guard Technical Report

arXiv 2025

2025

WebWeaver: Structuring Web-Scale Evidence with Dynamic Outlines for Open-Ended Deep Research

arXiv 2025

2025

WebSailor: Navigating Super-human Reasoning for Web Agent

arXiv 2025

2025

ReSum: Unlocking Long-Horizon Search Intelligence via Context Summarization

arXiv 2025

2025

WebDancer: Towards Autonomous Information Seeking Agency

arXiv 2025

2025

WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning

arXiv 2025

2025

Scaling Generalist Data-Analytic Agents

arXiv 2025

2025

AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis

arXiv 2025

2025

OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking

arXiv 2025

2025

Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization

arXiv 2025

2025

Towards Text-Image Interleaved Retrieval

arXiv 2025

2025

SynWorld: Virtual Scenario Synthesis for Agentic Action Knowledge Refinement

arXiv 2025

2025

Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent

arXiv 2024

2024

Agent Planning with World Knowledge Model

arXiv 2024

2024

Benchmarking Agentic Workflow Generation

arXiv 2024

2024

mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval

arXiv 2024

2024

CoFE-RAG: A Comprehensive Full-chain Evaluation Framework for Retrieval-Augmented Generation with Enhanced Data Diversity

arXiv 2024

2024

Improving Retrieval Augmented Open-Domain Question-Answering with Vectorized Contexts

arXiv 2024

2024

SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding

arXiv 2023

2023

A Two-Stage Adaptation of Large Language Models for Text Ranking

arXiv 2023

2023

Language Models are Universal Embedders

arXiv 2023

2023

Multi-CPR: A Multi Domain Chinese Dataset for Passage Retrieval

arXiv 2022

2022

Robust Self-Augmentation for Named Entity Recognition with Meta Reweighting

NAACL 2022 7

2022

AISHELL-NER: Named Entity Recognition from Chinese Speech

arXiv 2022

2022

Few-NERD: A Few-Shot Named Entity Recognition Dataset

ACL 2021 5

2021

Affiliations

No known affiliations.

Frequent co-authors

10

from 42 papers