Li Lucy
- Papers
- 3
Cite
Notes
Only stored in your browser.
3papers
Authored papers
3AboutMe: Using Self-Descriptions in Webpages to Document the Effects of English Pretraining Data Filters
arXiv 2024
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
arXiv 2024
Are distributional representations ready for the real world? Evaluating word vectors for grounded perceptual meaning
are-distributional-representations-ready-for-1
Affiliations
No known affiliations.
Frequent co-authors
10from 3 papers