Fahad Khan
- Papers
- 12
Cite
Notes
Only stored in your browser.
Authored papers
12MediX-R1: Open Ended Medical Reinforcement Learning
arXiv 2026
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM
arXiv 2025
KITAB-Bench: A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding
arXiv 2025
VideoMathQA: Benchmarking Mathematical Reasoning via Multimodal Understanding in Videos
arXiv 2025
EvoLMM: Self-Evolving Large Multimodal Models with Continuous Rewards
arXiv 2025
VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding
arXiv 2024
All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
CVPR 2025 1
BiMediX2: Bio-Medical EXpert LMM for Diverse Medical Modalities
arXiv 2024
Sentence-level Prompts Benefit Composed Image Retrieval
arXiv 2023
PG-Video-LLaVA: Pixel Grounding Large Video-Language Models
arXiv 2023
Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment
arXiv 2023
CLIP model is an Efficient Continual Learner
arXiv 2022
Affiliations
Frequent co-authors
10from 12 papers
Salman Khan
Hisham Cholakkal
Mohammed Irfan Kurpath
Muhammad Maaz
Rao Muhammad Anwer
Sahal Shaji Mullappilly
Abdelrahman Shaker
Hanoona Rasheed
Mubarak Shah
Rao Anwer