Memory Mosaics

Memory Mosaics, which are networks of associative memories, achieve prediction tasks with compositional and in-context learning capabilities more transparently than transformers and perform well in language modeling tasks.

Open

Preview
Year: 2024
Venue: arXiv 2024
ArXiv: arxiv.org/abs/2405.06394
Authors: 5
Hosting: Abstract onlyARXIV-DEFAULT

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text: arxiv.org/abs/2405.06394v3ARXIV-DEFAULT
TL;DR: Semantic Scholar

Attribution policy →

Abstract

Memory Mosaics are networks of associative memories working in concert to achieve a prediction task of interest. Like transformers, memory mosaics possess compositional capabilities and in-context learning capabilities. Unlike transformers, memory mosaics achieve these capabilities in comparatively transparent way ("predictive disentanglement"). We illustrate these capabilities on a toy example and also show that memory mosaics perform as well or better than transformers on medium-scale language modeling tasks.

Authors

Beidi Chen Jianyu Zhang Niklas Nolte Ranajoy Sadhukhan Léon Bottou