Mitesh M. Khapra

Papers: 25

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

25papers

Authored papers

Seeing Isn't Believing: Uncovering Blind Spots in Evaluator Vision-Language Models

arXiv 2026

2026

Recognizing Every Voice: Towards Inclusive ASR for Rural Bhojpuri Women

arXiv 2025

2025

Can Vision-Language Models Evaluate Handwritten Math?

arXiv 2025

2025

IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian Languages

arXiv 2024

2024

Rasa: Building Expressive Speech Synthesis Systems for Indian Languages in Low-resource Settings

arXiv 2024

2024

An Empirical Comparison of Vocabulary Expansion and Initialization Approaches for Language Models

arXiv 2024

2024

How Good is Zero-Shot MT Evaluation for Low Resource Indian Languages?

arXiv 2024

2024

Cross-Lingual Auto Evaluation for Assessing Multilingual LLMs

arXiv 2024

2024

Pralekha: An Indic Document Alignment Evaluation Benchmark

arXiv 2024

2024

LAHAJA: A Robust Multi-accent Benchmark for Evaluating Hindi ASR Systems

arXiv 2024

2024

Enhancing Out-of-Vocabulary Performance of Indian TTS Systems for Practical Applications through Low-Effort Data Strategies

arXiv 2024

2024

Finding Blind Spots in Evaluator LLMs with Interpretable Checklists

arXiv 2024

2024

Airavata: Introducing Hindi Instruction-tuned LLM

arXiv 2024

2024

BhasaAnuvaad: A Speech Translation Dataset for 13 Indian Languages

arXiv 2024

2024

Bhasha-Abhijnaanam: Native-script and romanized Language Identification for 22 Indic languages

arXiv 2023

2023

A Comprehensive Analysis of Adapter Efficiency

arXiv 2023

2023

IndicTrans2: Towards High-Quality and Accessible Machine Translation Models for all 22 Scheduled Indian Languages

arXiv 2023

2023

IndicSUPERB: A Speech Processing Universal Performance Benchmark for Indian languages

arXiv 2022

2022

Towards Leaving No Indic Language Behind: Building Monolingual Corpora, Benchmark and Models for Indic Languages

arXiv 2022

2022

Aksharantar: Open Indic-language Transliteration datasets and models for the Next Billion Users

arXiv 2022

2022

Towards Building Text-To-Speech Systems for the Next Billion Users

arXiv 2022

2022

IndicBART: A Pre-trained Model for Indic Natural Language Generation

indicbart-a-pre-trained-model-for-indic-1

2021

AI4Bharat-IndicNLP Corpus: Monolingual Corpora and Word Embeddings for Indic Languages

arXiv 2020

2020

DuoRC: Towards Complex Language Understanding with Paraphrased Reading Comprehension

duorc-towards-complex-language-understanding-1

2018

Towards Exploiting Background Knowledge for Building Conversation Systems

towards-exploiting-background-knowledge-for-1

2018

Affiliations

No known affiliations.

Frequent co-authors

from 25 papers

Anoop Kunchukuttan

15 shared papers

Raj Dabre

10 shared papers

Mohammed Safi Ur Rahman Khan

Pratyush Kumar

Ratish Puduppully

Sumanth Doddapaneni

Ashwin Sankar

Tahir Javed

Giri Raju

Janki Nawale