How well can NLP models generalize to a variety of unseen tasks when provided with task instructions? To address this question, we first introduce Super-NaturalInstructions, a benchmark of 1,616 diverse NLP tasks and their expert-written instructions. Our collection covers 76 distinct task types, including but not limited to classification, extraction, infilling, sequence tagging, text rewriting, and text composition. This large and diverse collection of tasks enables rigorous benchmarking of cross-task generalization under instructions -- training models to follow instructions on a subset of tasks and evaluating them on the remaining unseen ones. Furthermore, we build Tk-Instruct, a transformer model trained to follow a variety of in-context instructions (plain language task definitions or k-shot examples). Our experiments show that Tk-Instruct outperforms existing instruction-following models such as InstructGPT by over 9% on our benchmark despite being an order of magnitude smaller. We further analyze generalization as a function of various scaling parameters, such as the number of observed tasks, the number of instances per task, and model sizes. We hope our dataset and model facilitate future progress towards more general-purpose NLP models.
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks
Tk-Instruct, a smaller transformer model, achieves superior performance on a diverse range of NLP tasks defined by instructions compared to larger models, highlighting the importance of varied training data in generalization.
- Year
- 2022
- Venue
- arXiv 2022
- Authors
- 40
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/2204.07705v3ARXIV-DEFAULT
- TL;DR
- Semantic Scholar
Abstract
Authors
40Hannaneh HajishirziYejin ChoiYizhong WangNoah A. SmithSwaroop MishraChitta BaralDaniel KhashabiPegah AlipoormolabashiXudong ShenSiddhartha MishraYeganeh KordiAmirreza MirzaeiAnjana ArunkumarArjun AshokArut Selvan DhanasekaranAtharva NaikDavid StapEshaan PathakGiannis KaramanolakisHaizhi Gary LaiIshan PurohitIshani MondalJacob AndersonKirby KuzniaKrima DoshiMaitreya PatelKuntal Kumar PalMehrad MoradshahiMihir ParmarMirali PurohitNeeraj VarshneyPhani Rohitha KazaPulkit VermaRavsehaj Singh PuriRushang KariaShailaja Keyur SampatSavan DoshiSujan ReddySumanta PatroTanay Dixit