Ashish Sabharwal

Papers: 26

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

26papers

Authored papers

26

Olmo 3

arXiv 2025

DiscoveryBench: Towards Data-Driven Discovery with Large Language Models

arXiv 2024

AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agents

arXiv 2024

SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories

arXiv 2024

Specializing Smaller Language Models towards Multi-Step Reasoning

arXiv 2023

Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs

arXiv 2023

Closing the Curious Case of Neural Text Degeneration

arXiv 2023

Leveraging Code to Improve In-context Learning for Semantic Parsing

arXiv 2023

Increasing Probability Mass on Answer Choices Does Not Always Improve Accuracy

arXiv 2023

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

TMLR

Decomposed Prompting: A Modular Approach for Solving Complex Tasks

arXiv 2022

Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions

arXiv 2022

Lila: A Unified Benchmark for Mathematical Reasoning

arXiv 2022

DISCO: Distilling Counterfactuals with Large Language Models

arXiv 2022

Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard Contexts

arXiv 2022

Breakpoint Transformers for Modeling and Tracking Intermediate Beliefs

arXiv 2022

What Makes Instruction Learning Hard? An Investigation and a New Challenge in a Synthetic Environment

arXiv 2022

MuSiQue: Multihop Questions via Single-hop Question Composition

arXiv 2021

GooAQ: Open Question Answering with Diverse Answer Types

Findings (EMNLP) 2021 11

Hey AI, Can You Solve Complex Tasks by Talking to Agents?

Findings (ACL) 2022 5

Prompt Waywardness: The Curious Case of Discretized Interpretation of Continuous Prompts

NAACL 2022 7

UnifiedQA: Crossing Format Boundaries With a Single QA System

Findings of the Association for Computational Linguistics 2020

Text Modular Networks: Learning to Decompose Tasks in the Language of Existing Models

NAACL 2021 4

QASC: A Dataset for Question Answering via Sentence Composition

arXiv 2019

What Does My QA Model Know? Devising Controlled Probes using Expert Knowledge

arXiv 2019

Can a Suit of Armor Conduct Electricity? A New Dataset for Open Book Question Answering

can-a-suit-of-armor-conduct-electricity-a-new-1

Affiliations

No known affiliations.

Frequent co-authors

10

from 26 papers

Tushar Khot

17 shared papers

Peter Clark

12 shared papers

Kyle Richardson

10 shared papers

Daniel Khashabi

6 shared papers

Harsh Trivedi

6 shared papers

Hannaneh Hajishirzi

professor

5 shared papers

Matthew Finlayson

5 shared papers

Niranjan Balasubramanian

4 shared papers

Shashank Gupta

4 shared papers

Oyvind Tafjord

3 shared papers