Pradeep Dasigi
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13Meta-Reinforcement Learning with Self-Reflection for Agentic Search
arXiv 2026
Olmo 3
arXiv 2025
DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research
arXiv 2025
Large-Scale Data Selection for Instruction Tuning
arXiv 2025
2 OLMo 2 Furious
arXiv 2024
Tulu 3: Pushing Frontiers in Open Language Model Post-Training
preprint
OLMo: Accelerating the Science of Language Models
arXiv 2024
Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback
arXiv 2024
HREF: Human Response-Guided Evaluation of Instruction Following in Language Models
arXiv 2024
Scalable Data Ablation Approximations for Language Models through Modular Training and Merging
arXiv 2024
TRAM: Bridging Trust Regions and Sharpness Aware Minimization
arXiv 2023
Generating Data to Mitigate Spurious Correlations in Natural Language Inference Datasets
arXiv 2022
A Dataset of Information-Seeking Questions and Answers Anchored in Research Papers
NAACL 2021 4
Affiliations
Frequent co-authors
10from 13 papers
Hannaneh Hajishirzi
professor
Faeze Brahman
researcher
Hamish Ivison
grad-student
Noah A. Smith
Luca Soldaini
Nathan Lambert
researcher
Valentina Pyatkin
research-scientist
Jacob Morrison
research-engineer
Kyle Lo
Lester James V. Miranda