The detection and recognition of unconstrained text is an open problem in research. Text in comic books has unusual styles that raise many challenges for text detection. This work aims to binarize text in a comic genre with highly sophisticated text styles: Japanese manga. To overcome the lack of a manga dataset with text annotations at a pixel level, we create our own. To improve the evaluation and search of an optimal model, in addition to standard metrics in binarization, we implement other special metrics. Using these resources, we designed and evaluated a deep network model, outperforming current methods for text binarization in manga in most metrics.
Unconstrained Text Detection in Manga: a New Dataset and Baseline
A deep network model is developed for text binarization in Japanese manga, creating a dataset and special metrics to improve evaluation and achieve better performance than current methods.
- Year
- 2020
- Venue
- arXiv 2020
- Authors
- 2
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/2009.04042ARXIV-DEFAULT
- TL;DR
- Semantic Scholar