Cite
Notes
Only stored in your browser.
Attribution
RedPajama: an Open Dataset for Training Large Language Models
arXiv 2024
Proving Test Set Contamination in Black Box Language Models
arXiv 2023
from 2 papers
Anton Alexandrov
Ben Athiwaratkun
Ce Zhang
Christopher Ré
Daniel Fu
Faisal Ladhak
Huu Nguyen
Irina Rish
Kezhen Chen
Maurice Weber