Ashish Sabharwal
- Papers
- 26
Cite
Notes
Only stored in your browser.
Authored papers
26Olmo 3
arXiv 2025
DiscoveryBench: Towards Data-Driven Discovery with Large Language Models
arXiv 2024
AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agents
arXiv 2024
SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories
arXiv 2024
Increasing Probability Mass on Answer Choices Does Not Always Improve Accuracy
arXiv 2023
Specializing Smaller Language Models towards Multi-Step Reasoning
arXiv 2023
Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs
arXiv 2023
Closing the Curious Case of Neural Text Degeneration
arXiv 2023
Leveraging Code to Improve In-context Learning for Semantic Parsing
arXiv 2023
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
TMLR
Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions
arXiv 2022
Breakpoint Transformers for Modeling and Tracking Intermediate Beliefs
arXiv 2022
Decomposed Prompting: A Modular Approach for Solving Complex Tasks
arXiv 2022
Lila: A Unified Benchmark for Mathematical Reasoning
arXiv 2022
DISCO: Distilling Counterfactuals with Large Language Models
arXiv 2022
Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard Contexts
arXiv 2022
What Makes Instruction Learning Hard? An Investigation and a New Challenge in a Synthetic Environment
arXiv 2022
MuSiQue: Multihop Questions via Single-hop Question Composition
arXiv 2021
GooAQ: Open Question Answering with Diverse Answer Types
Findings (EMNLP) 2021 11
Hey AI, Can You Solve Complex Tasks by Talking to Agents?
Findings (ACL) 2022 5
Prompt Waywardness: The Curious Case of Discretized Interpretation of Continuous Prompts
NAACL 2022 7
UnifiedQA: Crossing Format Boundaries With a Single QA System
Findings of the Association for Computational Linguistics 2020
Text Modular Networks: Learning to Decompose Tasks in the Language of Existing Models
NAACL 2021 4
QASC: A Dataset for Question Answering via Sentence Composition
arXiv 2019
What Does My QA Model Know? Devising Controlled Probes using Expert Knowledge
arXiv 2019
Can a Suit of Armor Conduct Electricity? A New Dataset for Open Book Question Answering
can-a-suit-of-armor-conduct-electricity-a-new-1
Affiliations
Frequent co-authors
10from 26 papers
Tushar Khot
Peter Clark
Kyle Richardson
Daniel Khashabi
Harsh Trivedi
Hannaneh Hajishirzi
professor
Matthew Finlayson
Niranjan Balasubramanian
Shashank Gupta
Oyvind Tafjord