In this paper, we propose a novel top-down instance segmentation framework based on explicit shape encoding, named \textbf{ESE-Seg}. It largely reduces the computational consumption of the instance segmentation by explicitly decoding the multiple object shapes with tensor operations, thus performs the instance segmentation at almost the same speed as the object detection. ESE-Seg is based on a novel shape signature Inner-center Radius (IR), Chebyshev polynomial fitting and the strong modern object detectors. ESE-Seg with YOLOv3 outperforms the Mask R-CNN on Pascal VOC 2012 at mAP$^r$@0.5 while 7 times faster.
Explicit Shape Encoding for Real-Time Instance Segmentation
ESE-Seg, a top-down instance segmentation framework, reduces computational cost by using explicit shape encoding and tensor operations, achieving performance similar to object detection and outpacing Mask R-CNN on Pascal VOC 2012 with YOLOv3.
- Year
- 2019
- Venue
- explicit-shape-encoding-for-real-time-1
- Authors
- 4
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/1908.04067ARXIV-DEFAULT
- TL;DR
- Semantic Scholar