Jason D. Lee
- Papers
- 17
Cite
Notes
Only stored in your browser.
Authored papers
17Statistical Learning Theory in Lean 4: Empirical Processes from Scratch
arXiv 2026
What Makes a Reward Model a Good Teacher? An Optimization Perspective
arXiv 2025
Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding
arXiv 2025
Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads
arXiv 2024
BitDelta: Your Fine-Tune May Only Be Worth One Bit
arXiv 2024
Dataset Reset Policy Optimization for RLHF
arXiv 2024
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
arXiv 2024
LoRA Training in the NTK Regime has No Spurious Local Minima
arXiv 2024
Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark
arXiv 2024
REBEL: Reinforcement Learning via Regressing Relative Rewards
arXiv 2024
How Transformers Learn Causal Structure with Gradient Descent
arXiv 2024
Fine-Tuning Language Models with Just Forward Passes
fine-tuning-language-models-with-just-forward
REST: Retrieval-Based Speculative Decoding
arXiv 2023
Dichotomy of Early and Late Phase Implicit Biases Can Provably Induce Grokking
arXiv 2023
Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning
arXiv 2023
Looped Transformers as Programmable Computers
arXiv 2023
Teaching Arithmetic to Small Transformers
arXiv 2023
Affiliations
Frequent co-authors
10from 17 papers