In cross-lingual transfer, NLP models over one or more source languages are applied to a low-resource target language. While most prior work has used a single source model or a few carefully selected models, here we consider a `massive' setting with many such models. This setting raises the problem of poor transfer, particularly from distant languages. We propose two techniques for modulating the transfer, suitable for zero-shot or few-shot learning, respectively. Evaluating on named entity recognition, we show that our techniques are much more effective than strong baselines, including standard ensembling, and our unsupervised method rivals oracle selection of the single best individual model.
Massively Multilingual Transfer for NER
Two techniques for modulating model transfer from multiple source languages to a low-resource target language improve performance significantly in zero-shot and few-shot learning settings.
- Year
- 2019
- Venue
- massively-multilingual-transfer-for-ner
- Authors
- 3
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/1902.00193v4ARXIV-DEFAULT
- TL;DR
- Semantic Scholar