We present Jamba-1.5, new instruction-tuned large language models based on our Jamba architecture. Jamba is a hybrid Transformer-Mamba mixture of experts architecture, providing high throughput and low memory usage across context lengths, while retaining the same or better quality as Transformer models. We release two model sizes: Jamba-1.5-Large, with 94B active parameters, and Jamba-1.5-Mini, with 12B active parameters. Both models are fine-tuned for a variety of conversational and instruction-following capabilties, and have an effective context length of 256K tokens, the largest amongst open-weight models. To support cost-effective inference, we introduce ExpertsInt8, a novel quantization technique that allows fitting Jamba-1.5-Large on a machine with 8 80GB GPUs when processing 256K-token contexts without loss of quality. When evaluated on a battery of academic and chatbot benchmarks, Jamba-1.5 models achieve excellent results while providing high throughput and outperforming other open-weight models on long-context benchmarks. The model weights for both sizes are publicly available under the Jamba Open Model License and we release ExpertsInt8 as open source.
Jamba-1.5: Hybrid Transformer-Mamba Models at Scale
Jamba-1.5, a hybrid Transformer-Mamba model, offers high throughput and low memory usage while achieving high quality in instruction-following tasks, supported by ExpertsInt8 quantization.
- Year
- 2024
- Venue
- arXiv 2024
- Authors
- 61
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/2408.12570ARXIV-DEFAULT
- TL;DR
- Semantic Scholar
Abstract
Authors
61Tomer AsidaYonatan BelinkovRoi CohenItay DalmedigosDor MuhlgayYoav ShohamOmri AbendGal ShachafInbal MagarNir RatnerJamba TeamBarak LenzAlan AraziAmir BergmanAvshalom ManevichBarak PelegBen AviramChen AlmagorClara FridmanDan PadnosDaniel GissinDaniel JannaiDor ZimbergEdden M GerberElad DolevEran KrakovskyErez SafahiErez SchwartzGal CohenHaim RozenblumHofit BataIdo BlassJhonathan OsinJulie FadlonMaria RozmanMatan DanosMichael GokhmanMor ZusmanNaama GidronNoam GatNoam RozenOded FriedOhad LeshnoOmer AntvergOpher LieberOr DaganOrit CohaviRaz AlonRo'i BelsonRom GiladRoman GlozmanShahar LevShaked MeiromTal DelbariTal NessTom Ben GalTom BraudeUriya PumerantzYehoshua CohenYuval GlobersonYuval Peleg Levy