Sean Welleck
- Papers
- 26
Cite
Notes
Only stored in your browser.
Authored papers
26On the limits and opportunities of AI reviewers: Reviewing the reviews of Nature-family papers with 45 expert scientists
arXiv 2026
Argument Reconstruction as Supervision for Critical Thinking in LLMs
arXiv 2026
RefineBench: Evaluating Refinement Capability of Language Models via Checklists
arXiv 2025
Propose, Solve, Verify: Self-Play Through Formal Verification
arXiv 2025
Agentic-R1: Distilled Dual-Strategy Reasoning
arXiv 2025
Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators
arXiv 2025
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
arXiv 2024
Inference Scaling Laws: An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models
arXiv 2024
Evaluating Language Models as Synthetic Data Generators
arXiv 2024
miniCTX: Neural Theorem Proving with (Long-)Contexts
arXiv 2024
Faith and Fate: Limits of Transformers on Compositionality
faith-and-fate-limits-of-transformers-on
Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning
arXiv 2023
STEER: Unified Style Transfer with Expert Reinforcement
arXiv 2023
Draft, Sketch, and Prove: Guiding Formal Theorem Provers with Informal Proofs
arXiv 2022
A Survey of Deep Learning for Mathematical Reasoning
arXiv 2022
COLD Decoding: Energy-based Constrained Text Generation with Langevin Dynamics
arXiv 2022
Lila: A Unified Benchmark for Mathematical Reasoning
arXiv 2022
Quark: Controllable Text Generation with Reinforced Unlearning
arXiv 2022
NaturalProver: Grounded Mathematical Proof Generation with Language Models
arXiv 2022
Rainier: Reinforced Knowledge Introspector for Commonsense Question Answering
arXiv 2022
MAUVE: Measuring the Gap Between Neural Text and Human Text using Divergence Frontiers
mauve-measuring-the-gap-between-neural-text
Symbolic Knowledge Distillation: from General Language Models to Commonsense Models
NAACL 2022 7
NaturalProofs: Mathematical Theorem Proving in Natural Language
arXiv 2021
Generated Knowledge Prompting for Commonsense Reasoning
ACL 2022 5
NeuroLogic A*esque Decoding: Constrained Text Generation with Lookahead Heuristics
NAACL 2022 7
Prompt Waywardness: The Curious Case of Discretized Interpretation of Continuous Prompts
NAACL 2022 7
Affiliations
Frequent co-authors
10from 26 papers