0

Yizhe Zhang

Research scientist at Apple MLR; previously at Meta AI and Microsoft Research; works on LLM training, agents, and dialogue.

Role
researcher
Currently at
Apple
Papers
20

Cite

Notes

Only stored in your browser.

20papers

Authored papers

20

Embarrassingly Simple Self-Distillation Improves Code Generation

arXiv 2026

2026

CLaRa: Bridging Retrieval and Generation with Continuous Latent Reasoning

arXiv 2025

2025

SAGE: Steering and Refining Dialog Generation with State-Action Augmentation

arXiv 2025

2025

Reversal Blessing: Thinking Backward May Outpace Thinking Forward in Multi-choice Questions

arXiv 2025

2025

Learn to Reason Efficiently with Adaptive Length-based Reward Shaping

arXiv 2025

2025

Eliciting In-context Retrieval and Reasoning for Long-context Large Language Models

arXiv 2025

2025

DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation

arXiv 2025

2025

SWE-Gym: An Open Environment for Training Software Engineering Agents and Verifiers

preprint

2024

OpenHands: An Open Platform for AI Software Developers as Generalist Agents

arXiv 2024

2024

Scaling Diffusion Language Models via Adaptation from Autoregressive Models

arXiv 2024

2024

ToolSandbox: A Stateful, Conversational, Interactive Evaluation Benchmark for LLM Tool Use Capabilities

arXiv 2024

2024

Divide-or-Conquer? Which Part Should You Distill Your LLM?

arXiv 2024

2024

How Far Are We from Intelligent Visual Deductive Reasoning?

arXiv 2024

2024

Matryoshka Diffusion Models

arXiv 2023

2023

Stabilizing Transformer Training by Preventing Attention Entropy Collapse

arXiv 2023

2023

Probing the Multi-turn Planning Capabilities of LLMs via 20 Question Games

arXiv 2023

2023

PLANNER: Generating Diversified Paragraph via Latent Language Diffusion Model

planner-generating-diversified-paragraph-via

2023

RetGen: A Joint framework for Retrieval and Grounded Text Generation Modeling

arXiv 2021

2021

POINTER: Constrained Progressive Text Generation via Insertion-based Generative Pre-training

EMNLP 2020 11

2020

DialoGPT: Large-Scale Generative Pre-training for Conversational Response Generation

arXiv 2019

2019

Affiliations

Frequent co-authors

10

from 20 papers