Anoop Kunchukuttan
- Papers
- 17
Cite
Notes
Only stored in your browser.
Authored papers
17IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian Languages
arXiv 2024
BhasaAnuvaad: A Speech Translation Dataset for 13 Indian Languages
arXiv 2024
An Empirical Comparison of Vocabulary Expansion and Initialization Approaches for Language Models
arXiv 2024
How Good is Zero-Shot MT Evaluation for Low Resource Indian Languages?
arXiv 2024
Cross-Lingual Auto Evaluation for Assessing Multilingual LLMs
arXiv 2024
Pralekha: An Indic Document Alignment Evaluation Benchmark
arXiv 2024
Airavata: Introducing Hindi Instruction-tuned LLM
arXiv 2024
Bhasha-Abhijnaanam: Native-script and romanized Language Identification for 22 Indic languages
arXiv 2023
A Comprehensive Analysis of Adapter Efficiency
arXiv 2023
IndicTrans2: Towards High-Quality and Accessible Machine Translation Models for all 22 Scheduled Indian Languages
arXiv 2023
Evaluating Inter-Bilingual Semantic Parsing for Indian Languages
arXiv 2023
IndicXNLI: Evaluating Multilingual Inference for Indian Languages
arXiv 2022
IndicSUPERB: A Speech Processing Universal Performance Benchmark for Indian languages
arXiv 2022
Towards Leaving No Indic Language Behind: Building Monolingual Corpora, Benchmark and Models for Indic Languages
arXiv 2022
Aksharantar: Open Indic-language Transliteration datasets and models for the Next Billion Users
arXiv 2022
IndicBART: A Pre-trained Model for Indic Natural Language Generation
indicbart-a-pre-trained-model-for-indic-1
AI4Bharat-IndicNLP Corpus: Monolingual Corpora and Word Embeddings for Indic Languages
arXiv 2020
Affiliations
Frequent co-authors
10from 17 papers
Mitesh M. Khapra
Raj Dabre
Pratyush Kumar
Ratish Puduppully
Mohammed Safi Ur Rahman Khan
Sumanth Doddapaneni
Divyanshu Aggarwal
Jay Gala
Nandini Mundra
Sparsh Jain