Research in NLP lacks geographic diversity, and the question of how NLP can be scaled to low-resourced languages has not yet been adequately solved. "Low-resourced"-ness is a complex problem going beyond data availability and reflects systemic problems in society. In this paper, we focus on the task of Machine Translation (MT), that plays a crucial role for information accessibility and communication worldwide. Despite immense improvements in MT over the past decade, MT is centered around a few high-resourced languages. As MT researchers cannot solve the problem of low-resourcedness alone, we propose participatory research as a means to involve all necessary agents required in the MT development process. We demonstrate the feasibility and scalability of participatory research with a case study on MT for African languages. Its implementation leads to a collection of novel translation datasets, MT benchmarks for over 30 languages, with human evaluations for a third of them, and enables participants without formal training to make a unique scientific contribution. Benchmarks, models, data, code, and evaluation results are released under https://github.com/masakhane-io/masakhane-mt.
Participatory Research for Low-resourced Machine Translation: A Case Study in African Languages
Participatory research in machine translation for African languages results in new datasets, benchmarks, and human evaluations, addressing low-resourcedness issues.
- Year
- 2020
- Venue
- Findings of the Association for Computational Linguistics 2020
- Authors
- 48
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/2010.02353v2ARXIV-DEFAULT
- TL;DR
- Semantic Scholar
Abstract
Authors
48Julia KreutzerHady ElsaharWilhelmina NekotoVukosi MarivateTshinondiwa MatsilaTimi FasubaaTajudeen KolawoleTaiwo FagbohungbeSolomon Oluwole AkinolaShamsuddeen Hassan MuhammadSalomon KabongoSalomey OseiSackey FreshiaRubungo Andre NiyongaboRicky MacharmPerez OgayoOrevaoghene AhiaMusie MeressaMofe AdeyemiMasabata Mokgesi-SelingaLawrence OkegbemiLaura Jane MartinusKolawole TajudeenKevin DegilaKelechi OguejiKathleen SiminyuJason WebsterJamiil Toure AliJade AbbottIroro OrifeIgnatius EzeaniIdris Abdulkabir DanganaHerman KamperGoodness DuruGhollah KiokoEspoir MurhabaziElan van BiljonDaniel WhitenackChristopher OnyefuluchiChris EmezueBonaventure DossouBlessing SibandaBlessing Itoro BasseyAyodele OlabiyiArshath RamkilowanAlp ÖktemAdewale AkinfaderinAbdallah Bashir