0

SciFive: a text-to-text transformer model for biomedical literature

SciFive, a domain-specific T5 model pre-trained on biomedical corpora, outperforms SOTA methods on various biomedical NLP tasks, demonstrating the potential of text-generation methods in generating longer, complex outputs.

Year
2021
Venue
arXiv 2021
Authors
7
Hosting
Abstract onlyARXIV-DEFAULT

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text
arxiv.org/abs/2106.03598ARXIV-DEFAULT
TL;DR
Semantic Scholar
Attribution policy →

Abstract

In this report, we introduce SciFive, a domain-specific T5 model that has been pre-trained on large biomedical corpora. Our model outperforms the current SOTA methods (i.e. BERT, BioBERT, Base T5) on tasks in named entity relation, relation extraction, natural language inference, and question-answering. We show that text-generation methods have significant potential in a broad array of biomedical NLP tasks, particularly those requiring longer, more complex outputs. Our results support the exploration of more difficult text generation tasks and the development of new methods in this area

Authors

7