Pradeep Dasigi

Papers: 13

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

13papers

Authored papers

Meta-Reinforcement Learning with Self-Reflection for Agentic Search

arXiv 2026

2026

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

arXiv 2025

2025

Large-Scale Data Selection for Instruction Tuning

arXiv 2025

2025

Olmo 3

arXiv 2025

2025

2 OLMo 2 Furious

arXiv 2024

2024

Tulu 3: Pushing Frontiers in Open Language Model Post-Training

preprint

2024

OLMo: Accelerating the Science of Language Models

arXiv 2024

2024

Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback

arXiv 2024

2024

HREF: Human Response-Guided Evaluation of Instruction Following in Language Models

arXiv 2024

2024

Scalable Data Ablation Approximations for Language Models through Modular Training and Merging

arXiv 2024

2024

TRAM: Bridging Trust Regions and Sharpness Aware Minimization

arXiv 2023

2023

Generating Data to Mitigate Spurious Correlations in Natural Language Inference Datasets

arXiv 2022

2022

A Dataset of Information-Seeking Questions and Answers Anchored in Research Papers

NAACL 2021 4

2021

Affiliations

No known affiliations.

Frequent co-authors

from 13 papers

Hannaneh Hajishirzi

professor

8 shared papers

Faeze Brahman

researcher

7 shared papers

Hamish Ivison

grad-student

7 shared papers

Noah A. Smith

7 shared papers

Luca Soldaini

5 shared papers

Nathan Lambert

researcher

5 shared papers

Valentina Pyatkin

research-scientist

5 shared papers

Jacob Morrison

research-engineer

4 shared papers

Kyle Lo

4 shared papers

Lester James V. Miranda

4 shared papers