Fajri Koto
- Papers
- 14
Cite
Notes
Only stored in your browser.
Authored papers
14Instruction-Guided Poetry Generation in Arabic and Its Dialects
arXiv 2026
Qorgau: Evaluating LLM Safety in Kazakh-Russian Bilingual Contexts
arXiv 2025
Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs
arXiv 2025
Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia
arXiv 2025
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
arXiv 2024
ArabicMMLU: Assessing Massive Multitask Language Understanding in Arabic
arXiv 2024
CMMLU: Measuring massive multitask language understanding in Chinese
arXiv 2023
Bactrian-X: Multilingual Replicable Instruction-Following Models with Low-Rank Adaptation
arXiv 2023
LLM360: Towards Fully Transparent Open-Source LLMs
arXiv 2023
Large Language Models Only Pass Primary School Exams in Indonesia: A Comprehensive Test on IndoMMLU
arXiv 2023
NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages
arXiv 2023
NusaCrowd: Open Source Initiative for Indonesian NLP Resources
arXiv 2022
IndoBERTweet: A Pretrained Language Model for Indonesian Twitter with Effective Domain-Specific Vocabulary Initialization
EMNLP 2021 11
Liputan6: A Large-scale Indonesian Dataset for Text Summarization
Asian Chapter of the Association for Computational Linguistics 2020
Affiliations
Frequent co-authors
10from 14 papers