0

Yelong Shen

Papers
22

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
22papers

Authored papers

22

Orchard: An Open-Source Agentic Modeling Framework

arXiv 2026

2026

Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability

arXiv 2026

2026

Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space

arXiv 2025

2025

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

arXiv 2025

2025

SwS: Self-aware Weakness-driven Problem Synthesis in Reinforcement Learning for LLM Reasoning

arXiv 2025

2025

ThetaEvolve: Test-time Learning on Open Problems

arXiv 2025

2025

Gold-Medal-Level Olympiad Geometry Solving with Efficient Heuristic Auxiliary Constructions

arXiv 2025

2025

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

arXiv 2025

2025

Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation

arXiv 2025

2025

OmniParser for Pure Vision Based GUI Agent

arXiv 2024

2024

Rho-1: Not All Tokens Are What You Need

arXiv 2024

2024

MTL-LoRA: Low-Rank Adaptation for Multi-Task Learning

arXiv 2024

2024

Ensuring Safe and High-Quality Outputs: A Guideline Library Approach for Language Models

arXiv 2024

2024

An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models

arXiv 2023

2023

ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving

arXiv 2023

2023

CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing

arXiv 2023

2023

In-Context Learning Unlocked for Diffusion Models

in-context-learning-unlocked-for-diffusion

2023

Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models

arXiv 2023

2023

GENIUS: Sketch-based Language Model Pre-training via Extreme and Selective Masking for Text Generation and Augmentation

arXiv 2022

2022

LoRA: Low-Rank Adaptation of Large Language Models

lora-low-rank-adaptation-of-large-language-1

2021

Adversarial Retriever-Ranker for dense text retrieval

adversarial-retriever-ranker-for-dense-text-1

2021

Generation-Augmented Retrieval for Open-domain Question Answering

ACL 2021 5

2020

Affiliations

No known affiliations.

Frequent co-authors

10

from 22 papers