0

Protoformer: Embedding Prototypes for Transformers

Protoformer is a self-learning framework for Transformers that enhances text classification by efficiently using anomaly and difficult class prototypes.

Year
2022
Venue
protoformer-embedding-prototypes-for
Authors
6
Hosting
Abstract onlyARXIV-DEFAULT

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text
arxiv.org/abs/2206.12710ARXIV-DEFAULT
TL;DR
Semantic Scholar
Attribution policy →

Abstract

Transformers have been widely applied in text classification. Unfortunately, real-world data contain anomalies and noisy labels that cause challenges for state-of-art Transformers. This paper proposes Protoformer, a novel self-learning framework for Transformers that can leverage problematic samples for text classification. Protoformer features a selection mechanism for embedding samples that allows us to efficiently extract and utilize anomalies prototypes and difficult class prototypes. We demonstrated such capabilities on datasets with diverse textual structures (e.g., Twitter, IMDB, ArXiv). We also applied the framework to several models. The results indicate that Protoformer can improve current Transformers in various empirical settings.

Authors

6