0

Mitesh M. Khapra

Papers
25

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
25papers

Authored papers

25

Seeing Isn't Believing: Uncovering Blind Spots in Evaluator Vision-Language Models

arXiv 2026

2026

Recognizing Every Voice: Towards Inclusive ASR for Rural Bhojpuri Women

arXiv 2025

2025

Can Vision-Language Models Evaluate Handwritten Math?

arXiv 2025

2025

IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian Languages

arXiv 2024

2024

BhasaAnuvaad: A Speech Translation Dataset for 13 Indian Languages

arXiv 2024

2024

How Good is Zero-Shot MT Evaluation for Low Resource Indian Languages?

arXiv 2024

2024

Pralekha: An Indic Document Alignment Evaluation Benchmark

arXiv 2024

2024

LAHAJA: A Robust Multi-accent Benchmark for Evaluating Hindi ASR Systems

arXiv 2024

2024

Enhancing Out-of-Vocabulary Performance of Indian TTS Systems for Practical Applications through Low-Effort Data Strategies

arXiv 2024

2024

Finding Blind Spots in Evaluator LLMs with Interpretable Checklists

arXiv 2024

2024

Airavata: Introducing Hindi Instruction-tuned LLM

arXiv 2024

2024

Rasa: Building Expressive Speech Synthesis Systems for Indian Languages in Low-resource Settings

arXiv 2024

2024

An Empirical Comparison of Vocabulary Expansion and Initialization Approaches for Language Models

arXiv 2024

2024

Cross-Lingual Auto Evaluation for Assessing Multilingual LLMs

arXiv 2024

2024

Bhasha-Abhijnaanam: Native-script and romanized Language Identification for 22 Indic languages

arXiv 2023

2023

IndicTrans2: Towards High-Quality and Accessible Machine Translation Models for all 22 Scheduled Indian Languages

arXiv 2023

2023

A Comprehensive Analysis of Adapter Efficiency

arXiv 2023

2023

Aksharantar: Open Indic-language Transliteration datasets and models for the Next Billion Users

arXiv 2022

2022

Towards Building Text-To-Speech Systems for the Next Billion Users

arXiv 2022

2022

Towards Leaving No Indic Language Behind: Building Monolingual Corpora, Benchmark and Models for Indic Languages

arXiv 2022

2022

IndicSUPERB: A Speech Processing Universal Performance Benchmark for Indian languages

arXiv 2022

2022

IndicBART: A Pre-trained Model for Indic Natural Language Generation

indicbart-a-pre-trained-model-for-indic-1

2021

AI4Bharat-IndicNLP Corpus: Monolingual Corpora and Word Embeddings for Indic Languages

arXiv 2020

2020

Towards Exploiting Background Knowledge for Building Conversation Systems

towards-exploiting-background-knowledge-for-1

2018

DuoRC: Towards Complex Language Understanding with Paraphrased Reading Comprehension

duorc-towards-complex-language-understanding-1

2018

Affiliations

No known affiliations.

Frequent co-authors

10

from 25 papers