Cite
Notes
Only stored in your browser.
Attribution
olmOCR: Unlocking Trillions of Tokens in PDFs with Vision Language Models
arXiv 2025
The Semantic Scholar Open Data Platform
arXiv 2023
from 2 papers
Jason Dunkelberger
Kyle Lo
Luca Soldaini
Regan Huff
Alex D. Wade
Alexandra Buraczynski
Aman Rangapur
Amanpreet Singh
Amber Tanaka
Angele Zamarron