Cite
Notes
Only stored in your browser.
Attribution
Scale Efficiently: Insights from Pre-training and Fine-tuning Transformers
arXiv 2021
ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning
ext5-towards-extreme-multi-task-scaling-for
Long Range Arena: A Benchmark for Efficient Transformers
arXiv 2020
from 3 papers
Donald Metzler
Yi Tay
founder
Dara Bahri
Mostafa Dehghani
Samira Abnar
Sebastian Ruder
Ashish Vaswani
Dani Yogatama
Honglei Zhuang
Huaixiu Steven Zheng