0

Ashish Sabharwal

Papers
26

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
26papers

Authored papers

26

Olmo 3

arXiv 2025

2025

DiscoveryBench: Towards Data-Driven Discovery with Large Language Models

arXiv 2024

2024

AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agents

arXiv 2024

2024

SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories

arXiv 2024

2024

Increasing Probability Mass on Answer Choices Does Not Always Improve Accuracy

arXiv 2023

2023

Specializing Smaller Language Models towards Multi-Step Reasoning

arXiv 2023

2023

Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs

arXiv 2023

2023

Closing the Curious Case of Neural Text Degeneration

arXiv 2023

2023

Leveraging Code to Improve In-context Learning for Semantic Parsing

arXiv 2023

2023

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

TMLR

2022

Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions

arXiv 2022

2022

Breakpoint Transformers for Modeling and Tracking Intermediate Beliefs

arXiv 2022

2022

Decomposed Prompting: A Modular Approach for Solving Complex Tasks

arXiv 2022

2022

Lila: A Unified Benchmark for Mathematical Reasoning

arXiv 2022

2022

DISCO: Distilling Counterfactuals with Large Language Models

arXiv 2022

2022

Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard Contexts

arXiv 2022

2022

What Makes Instruction Learning Hard? An Investigation and a New Challenge in a Synthetic Environment

arXiv 2022

2022

MuSiQue: Multihop Questions via Single-hop Question Composition

arXiv 2021

2021

GooAQ: Open Question Answering with Diverse Answer Types

Findings (EMNLP) 2021 11

2021

Hey AI, Can You Solve Complex Tasks by Talking to Agents?

Findings (ACL) 2022 5

2021

Prompt Waywardness: The Curious Case of Discretized Interpretation of Continuous Prompts

NAACL 2022 7

2021

UnifiedQA: Crossing Format Boundaries With a Single QA System

Findings of the Association for Computational Linguistics 2020

2020

Text Modular Networks: Learning to Decompose Tasks in the Language of Existing Models

NAACL 2021 4

2020

QASC: A Dataset for Question Answering via Sentence Composition

arXiv 2019

2019

What Does My QA Model Know? Devising Controlled Probes using Expert Knowledge

arXiv 2019

2019

Can a Suit of Armor Conduct Electricity? A New Dataset for Open Book Question Answering

can-a-suit-of-armor-conduct-electricity-a-new-1

2018

Affiliations

No known affiliations.

Frequent co-authors

10

from 26 papers