M. Jehanzeb Mirza
- Papers
- 12
Cite
Notes
Only stored in your browser.
Authored papers
12KV Cache Steering for Inducing Reasoning in Small Language Models
arXiv 2025
Overflow Prevention Enhances Long-Context Recurrent LLMs
arXiv 2025
PRISMM-Bench: A Benchmark of Peer-Review Grounded Multimodal Inconsistencies
arXiv 2025
TTRV: Test-Time Reinforcement Learning for Vision Language Models
arXiv 2025
VisualOverload: Probing Visual Understanding of VLMs in Really Dense Scenes
arXiv 2025
Can We Talk Models Into Seeing the World Differently?
arXiv 2024
GLOV: Guided Large Language Models as Implicit Optimizers for Vision Language Models
arXiv 2024
ConMe: Rethinking Evaluation of Compositional Reasoning for Modern VLMs
arXiv 2024
Teaching VLMs to Localize Specific Objects from In-context Examples
ICCV 2025
Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs
arXiv 2024
LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content
arXiv 2024
MATE: Masked Autoencoders are Online 3D Test-Time Learners
ICCV 2023 1
Affiliations
Frequent co-authors
10from 12 papers