Sharan Narang
- Papers
- 8
Cite
Notes
Only stored in your browser.
Authored papers
8Law of the Weakest Link: Cross Capabilities of Large Language Models
arXiv 2024
Llama 2: Open Foundation and Fine-Tuned Chat Models
arXiv 2023
UniMax: Fairer and more Effective Language Sampling for Large-Scale Multilingual Pretraining
arXiv 2023
Scaling Up Models and Data with $\texttt{t5x}$ and $\texttt{seqio}$
arXiv 2022
Scale Efficiently: Insights from Pre-training and Fine-tuning Transformers
arXiv 2021
Do Transformer Modifications Transfer Across Implementations and Applications?
EMNLP 2021 11
ByT5: Towards a token-free future with pre-trained byte-to-byte models
arXiv 2021
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
arXiv 2015
Affiliations
Frequent co-authors
10from 8 papers
Adam Roberts
Hyung Won Chung
researcher
Colin Raffel
Yi Tay
founder
Dani Yogatama
Melanie Kambadur
Noah Constant
Noah Fiedel
Noam Shazeer
VP / co-lead Gemini
Rui Hou