Zejiang Shen
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
arXiv 2024
The Semantic Scholar Open Data Platform
arXiv 2023
Multi-LexSum: Real-World Summaries of Civil Rights Lawsuits at Multiple Granularities
arXiv 2022
Don't Say What You Don't Know: Improving the Consistency of Abstractive Summarization by Constraining Beam Search
arXiv 2022
LayoutParser: A Unified Toolkit for Deep Learning Based Document Image Analysis
arXiv 2021
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers