PanGu-$π$: Enhancing Language Model Architectures via Nonlinearity Compensation

The recent trend of large language models (LLMs) is to increase the scale of both model size (\aka the number of parameters) and dataset to achieve better generative ability, which is definitely proved by a lot of work such as the famous GPT and Llama.

Open

Year: 2023
ArXiv: arxiv.org/abs/2312.17276
URL: arxiv.org/abs/2312.17276v1
Hosting: External sourcelicense unknown

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text: arxiv.org/abs/2312.17276v1
TL;DR: Semantic Scholar

Attribution policy →