0

wei he

Papers
20

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
20papers

Authored papers

20

Enhancing LLM Reasoning with Iterative DPO: A Comprehensive Empirical Investigation

arXiv 2025

2025

Better Process Supervision with Bi-directional Rewarding Signals

arXiv 2025

2025

ROOT: Robust Orthogonalized Optimizer for Neural Network Training

arXiv 2025

2025

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

arXiv 2025

2025

R-Horizon: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?

arXiv 2025

2025

VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications

arXiv 2025

2025

Agentic Learner with Grow-and-Refine Multimodal Semantic Memory

arXiv 2025

2025

MindWatcher: Toward Smarter Multimodal Tool-Integrated Reasoning

arXiv 2025

2025

MP-HSIR: A Multi-Prompt Framework for Universal Hyperspectral Image Restoration

arXiv 2025

2025

Distill Visual Chart Reasoning Ability from LLMs to MLLMs

arXiv 2024

2024

AgentGym: Evolving Large Language Model-based Agents across Diverse Environments

arXiv 2024

2024

Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning

arXiv 2024

2024

LongHeads: Multi-Head Attention is Secretly a Long Context Processor

arXiv 2024

2024

Free Video-LLM: Prompt-guided Visual Perception for Efficient Training-free Video LLMs

arXiv 2024

2024

CHOICE: Benchmarking the Remote Sensing Capabilities of Large Vision-Language Models

arXiv 2024

2024

Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language Models

arXiv 2024

2024

Mitigating Tail Narrowing in LLM Self-Improvement via Socratic-Guided Sampling

arXiv 2024

2024

LongAgent: Scaling Language Models to 128k Context through Multi-Agent Collaboration

arXiv 2024

2024

Gold-YOLO: Efficient Object Detector via Gather-and-Distribute Mechanism

gold-yolo-efficient-object-detector-via

2023

The Rise and Potential of Large Language Model Based Agents: A Survey

arXiv 2023

2023

Affiliations

No known affiliations.

Frequent co-authors

10

from 20 papers