0

Unconstrained Text Detection in Manga: a New Dataset and Baseline

A deep network model is developed for text binarization in Japanese manga, creating a dataset and special metrics to improve evaluation and achieve better performance than current methods.

Year
2020
Venue
arXiv 2020
Authors
2
Hosting
Abstract onlyARXIV-DEFAULT

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text
arxiv.org/abs/2009.04042ARXIV-DEFAULT
TL;DR
Semantic Scholar
Attribution policy →

Abstract

The detection and recognition of unconstrained text is an open problem in research. Text in comic books has unusual styles that raise many challenges for text detection. This work aims to binarize text in a comic genre with highly sophisticated text styles: Japanese manga. To overcome the lack of a manga dataset with text annotations at a pixel level, we create our own. To improve the evaluation and search of an optimal model, in addition to standard metrics in binarization, we implement other special metrics. Using these resources, we designed and evaluated a deep network model, outperforming current methods for text binarization in manga in most metrics.

Authors

2