0

Tushar Khot

Papers
22

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
22papers

Authored papers

22

OLMo: Accelerating the Science of Language Models

arXiv 2024

2024

DiscoveryBench: Towards Data-Driven Discovery with Large Language Models

arXiv 2024

2024

Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning

arXiv 2024

2024

AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agents

arXiv 2024

2024

SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories

arXiv 2024

2024

Chain-of-Thought Hub: A Continuous Effort to Measure Large Language Models' Reasoning Performance

arXiv 2023

2023

Improving Language Model Negotiation with Self-Play and In-Context Learning from AI Feedback

arXiv 2023

2023

Specializing Smaller Language Models towards Multi-Step Reasoning

arXiv 2023

2023

Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs

arXiv 2023

2023

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

TMLR

2022

Decomposed Prompting: A Modular Approach for Solving Complex Tasks

arXiv 2022

2022

Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions

arXiv 2022

2022

Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard Contexts

arXiv 2022

2022

MuSiQue: Multihop Questions via Single-hop Question Composition

arXiv 2021

2021

GooAQ: Open Question Answering with Diverse Answer Types

Findings (EMNLP) 2021 11

2021

Prompt Waywardness: The Curious Case of Discretized Interpretation of Continuous Prompts

NAACL 2022 7

2021

Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies

arXiv 2021

2021

Hey AI, Can You Solve Complex Tasks by Talking to Agents?

Findings (ACL) 2022 5

2021

Text Modular Networks: Learning to Decompose Tasks in the Language of Existing Models

NAACL 2021 4

2020

UnifiedQA: Crossing Format Boundaries With a Single QA System

Findings of the Association for Computational Linguistics 2020

2020

QASC: A Dataset for Question Answering via Sentence Composition

arXiv 2019

2019

Can a Suit of Armor Conduct Electricity? A New Dataset for Open Book Question Answering

can-a-suit-of-armor-conduct-electricity-a-new-1

2018

Affiliations

No known affiliations.

Frequent co-authors

10

from 22 papers