M. Jehanzeb Mirza

Papers: 12

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

12papers

Authored papers

TTRV: Test-Time Reinforcement Learning for Vision Language Models

arXiv 2025

2025

KV Cache Steering for Inducing Reasoning in Small Language Models

arXiv 2025

2025

VisualOverload: Probing Visual Understanding of VLMs in Really Dense Scenes

arXiv 2025

2025

Overflow Prevention Enhances Long-Context Recurrent LLMs

arXiv 2025

2025

PRISMM-Bench: A Benchmark of Peer-Review Grounded Multimodal Inconsistencies

arXiv 2025

2025

LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content

arXiv 2024

2024

Teaching VLMs to Localize Specific Objects from In-context Examples

ICCV 2025

2024

Can We Talk Models Into Seeing the World Differently?

arXiv 2024

2024

Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs

arXiv 2024

2024

GLOV: Guided Large Language Models as Implicit Optimizers for Vision Language Models

arXiv 2024

2024

ConMe: Rethinking Evaluation of Compositional Reasoning for Modern VLMs

arXiv 2024

2024

MATE: Masked Autoencoders are Online 3D Test-Time Learners

ICCV 2023 1

2022

Affiliations

No known affiliations.

Frequent co-authors

from 12 papers

Wei Lin

Sivan Doveh

James Glass

Leonid Karlinsky

Hilde Kuehne

Rogerio Feris

Paul Gavrikov

Assaf Arbelle

Horst Possegger

Raja Giryes