0

Hang Yan

Papers
29

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
29papers

Authored papers

29

OpenMobile: Building Open Mobile Agents with Task and Trajectory Synthesis

arXiv 2026

2026

OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions

arXiv 2026

2026

TIDE: Trajectory-based Diagnostic Evaluation of Test-Time Improvement in LLM Agents

arXiv 2026

2026

Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction

arXiv 2025

2025

Diffusion Language Models are Super Data Learners

arXiv 2025

2025

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

arXiv 2025

2025

$φ$-Decoding: Adaptive Foresight Sampling for Balanced Inference-Time Exploration and Exploitation

arXiv 2025

2025

Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning

arXiv 2025

2025

MUR: Momentum Uncertainty guided Reasoning for Large Language Models

arXiv 2025

2025

EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models

arXiv 2024

2024

Length Generalization of Causal Transformers without Position Encoding

arXiv 2024

2024

Secrets of RLHF in Large Language Models Part II: Reward Modeling

arXiv 2024

2024

Hint-before-Solving Prompting: Guiding LLMs to Effectively Utilize Encoded Knowledge

arXiv 2024

2024

MouSi: Poly-Visual-Expert Vision-Language Models

arXiv 2024

2024

What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices

arXiv 2024

2024

LongWanjuan: Towards Systematic Measurement for Long Text Quality

arXiv 2024

2024

AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight Tuning on Multi-source Data

arXiv 2024

2024

Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback

arXiv 2024

2024

Case2Code: Learning Inductive Reasoning with Synthetic Data

arXiv 2024

2024

Balanced Data Sampling for Language Model Training with Clustering

arXiv 2024

2024

ReAttention: Training-Free Infinite Context with Finite Attention Scope

arXiv 2024

2024

F-Eval: Assessing Fundamental Abilities with Refined Evaluation Methods

arXiv 2024

2024

CoLLiE: Collaborative Training of Large Language Models in an Efficient Way

arXiv 2023

2023

AdaLomo: Low-memory Optimization with Adaptive Learning Rate

arXiv 2023

2023

WanJuan: A Comprehensive Multimodal Dataset for Advancing English and Chinese Large Models

arXiv 2023

2023

Scaling Laws of RoPE-based Extrapolation

arXiv 2023

2023

CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors

arXiv 2023

2023

Unified Demonstration Retriever for In-Context Learning

arXiv 2023

2023

CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation

arXiv 2021

2021

Affiliations

No known affiliations.

Frequent co-authors

10

from 29 papers