Tanya Goyal
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13Challenges in Trustworthy Human Evaluation of Chatbots
arXiv 2024
LitSearch: A Retrieval Benchmark for Scientific Literature Search
arXiv 2024
FABLES: Evaluating faithfulness and content selection in book-length summarization
arXiv 2024
One Thousand and One Pairs: A "novel" challenge for long-context language models
arXiv 2024
D2PO: Discriminator-Guided DPO with Response Evaluation Models
arXiv 2024
Recycled Attention: Efficient inference for long-context language models
arXiv 2024
Evaluating Large Language Models at Evaluating Instruction Following
arXiv 2023
BooookScore: A systematic exploration of book-length summarization in the era of LLMs
arXiv 2023
WiCE: Real-World Entailment for Claims in Wikipedia
arXiv 2023
A Long Way to Go: Investigating Length Correlations in RLHF
arXiv 2023
News Summarization and Evaluation in the Era of GPT-3
arXiv 2022
Understanding Factual Errors in Summarization: Errors, Summarizers, Datasets, Error Detectors
arXiv 2022
NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation
arXiv 2021
Affiliations
Frequent co-authors
10from 13 papers