wei he
- Papers
- 20
Cite
Notes
Only stored in your browser.
Authored papers
20Enhancing LLM Reasoning with Iterative DPO: A Comprehensive Empirical Investigation
arXiv 2025
Better Process Supervision with Bi-directional Rewarding Signals
arXiv 2025
ROOT: Robust Orthogonalized Optimizer for Neural Network Training
arXiv 2025
AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning
arXiv 2025
R-Horizon: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?
arXiv 2025
VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications
arXiv 2025
Agentic Learner with Grow-and-Refine Multimodal Semantic Memory
arXiv 2025
MindWatcher: Toward Smarter Multimodal Tool-Integrated Reasoning
arXiv 2025
MP-HSIR: A Multi-Prompt Framework for Universal Hyperspectral Image Restoration
arXiv 2025
Distill Visual Chart Reasoning Ability from LLMs to MLLMs
arXiv 2024
AgentGym: Evolving Large Language Model-based Agents across Diverse Environments
arXiv 2024
Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning
arXiv 2024
LongHeads: Multi-Head Attention is Secretly a Long Context Processor
arXiv 2024
Free Video-LLM: Prompt-guided Visual Perception for Efficient Training-free Video LLMs
arXiv 2024
CHOICE: Benchmarking the Remote Sensing Capabilities of Large Vision-Language Models
arXiv 2024
Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language Models
arXiv 2024
Mitigating Tail Narrowing in LLM Self-Improvement via Socratic-Guided Sampling
arXiv 2024
LongAgent: Scaling Language Models to 128k Context through Multi-Agent Collaboration
arXiv 2024
Gold-YOLO: Efficient Object Detector via Gather-and-Distribute Mechanism
gold-yolo-efficient-object-detector-via
The Rise and Potential of Large Language Model Based Agents: A Survey
arXiv 2023
Affiliations
Frequent co-authors
10from 20 papers