Omkar Thawakar
- Papers
- 11
Cite
Notes
Only stored in your browser.
Authored papers
11Mobile-O: Unified Multimodal Understanding and Generation on Mobile Device
arXiv 2026
LLM Post-Training: A Deep Dive into Reasoning Large Language Models
arXiv 2025
DriveLMM-o1: A Step-by-Step Reasoning Dataset and Large Multimodal Model for Driving Scenario Understanding
arXiv 2025
Time Travel: A Comprehensive Benchmark to Evaluate LMMs on Historical and Cultural Artifacts
arXiv 2025
ARB: A Comprehensive Arabic Multimodal Reasoning Benchmark
arXiv 2025
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs
arXiv 2025
Fann or Flop: A Multigenre, Multiera Benchmark for Arabic Poetry Understanding in LLMs
arXiv 2025
All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
CVPR 2025 1
MobiLlama: Towards Accurate and Lightweight Fully Transparent GPT
arXiv 2024
CAMEL-Bench: A Comprehensive Arabic LMM Benchmark
arXiv 2024
XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models
arXiv 2023
Affiliations
Frequent co-authors
10from 11 papers