0

Docling Technical Report

This technical report introduces Docling, an easy to use, self-contained, MIT-licensed open-source package for PDF document conversion.

Year
2024
Venue
arXiv 2024
Authors
19
Hosting
Abstract onlyARXIV-DEFAULT

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text
arxiv.org/abs/2408.09869v5ARXIV-DEFAULT
TL;DR
Semantic Scholar
Attribution policy →

Abstract

This technical report introduces Docling, an easy to use, self-contained, MIT-licensed open-source package for PDF document conversion. It is powered by state-of-the-art specialized AI models for layout analysis (DocLayNet) and table structure recognition (TableFormer), and runs efficiently on commodity hardware in a small resource budget. The code interface allows for easy extensibility and addition of new features and models.

Authors

19