Quentin Anthony
- Papers
- 7
Cite
Notes
Only stored in your browser.
7papers
Authored papers
7Deep Ignorance: Filtering Pretraining Data Builds Tamper-Resistant Safeguards into Open-Weight LLMs
arXiv 2025
Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
arXiv 2024
Exploiting Inter-Layer Expert Affinity for Accelerating Mixture-of-Experts Model Inference
arXiv 2024
RedPajama: an Open Dataset for Training Large Language Models
arXiv 2024
BlackMamba: Mixture of Experts for State-Space Models
arXiv 2024
Zyda: A 1.3T Dataset for Open Language Modeling
arXiv 2024
RWKV: Reinventing RNNs for the Transformer Era
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 7 papers