Mitesh M. Khapra
- Papers
- 25
Cite
Notes
Only stored in your browser.
Authored papers
25Seeing Isn't Believing: Uncovering Blind Spots in Evaluator Vision-Language Models
arXiv 2026
Recognizing Every Voice: Towards Inclusive ASR for Rural Bhojpuri Women
arXiv 2025
Can Vision-Language Models Evaluate Handwritten Math?
arXiv 2025
IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian Languages
arXiv 2024
BhasaAnuvaad: A Speech Translation Dataset for 13 Indian Languages
arXiv 2024
How Good is Zero-Shot MT Evaluation for Low Resource Indian Languages?
arXiv 2024
Pralekha: An Indic Document Alignment Evaluation Benchmark
arXiv 2024
LAHAJA: A Robust Multi-accent Benchmark for Evaluating Hindi ASR Systems
arXiv 2024
Enhancing Out-of-Vocabulary Performance of Indian TTS Systems for Practical Applications through Low-Effort Data Strategies
arXiv 2024
Finding Blind Spots in Evaluator LLMs with Interpretable Checklists
arXiv 2024
Airavata: Introducing Hindi Instruction-tuned LLM
arXiv 2024
Rasa: Building Expressive Speech Synthesis Systems for Indian Languages in Low-resource Settings
arXiv 2024
An Empirical Comparison of Vocabulary Expansion and Initialization Approaches for Language Models
arXiv 2024
Cross-Lingual Auto Evaluation for Assessing Multilingual LLMs
arXiv 2024
Bhasha-Abhijnaanam: Native-script and romanized Language Identification for 22 Indic languages
arXiv 2023
IndicTrans2: Towards High-Quality and Accessible Machine Translation Models for all 22 Scheduled Indian Languages
arXiv 2023
A Comprehensive Analysis of Adapter Efficiency
arXiv 2023
Aksharantar: Open Indic-language Transliteration datasets and models for the Next Billion Users
arXiv 2022
Towards Building Text-To-Speech Systems for the Next Billion Users
arXiv 2022
Towards Leaving No Indic Language Behind: Building Monolingual Corpora, Benchmark and Models for Indic Languages
arXiv 2022
IndicSUPERB: A Speech Processing Universal Performance Benchmark for Indian languages
arXiv 2022
IndicBART: A Pre-trained Model for Indic Natural Language Generation
indicbart-a-pre-trained-model-for-indic-1
AI4Bharat-IndicNLP Corpus: Monolingual Corpora and Word Embeddings for Indic Languages
arXiv 2020
Towards Exploiting Background Knowledge for Building Conversation Systems
towards-exploiting-background-knowledge-for-1
DuoRC: Towards Complex Language Understanding with Paraphrased Reading Comprehension
duorc-towards-complex-language-understanding-1
Affiliations
Frequent co-authors
10from 25 papers
Anoop Kunchukuttan
Raj Dabre
Mohammed Safi Ur Rahman Khan
Pratyush Kumar
Ratish Puduppully
Sumanth Doddapaneni
Ashwin Sankar
Tahir Javed
Giri Raju
Janki Nawale