Jean Lahoud

Papers: 7

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

7papers

Authored papers

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

arXiv 2025

2025

Agent-X: Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks

arXiv 2025

2025

LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs

arXiv 2025

2025

DriveLMM-o1: A Step-by-Step Reasoning Dataset and Large Multimodal Model for Driving Scenario Understanding

arXiv 2025

2025

A Benchmark and Agentic Framework for Omni-Modal Reasoning and Tool Use in Long Videos

arXiv 2025

2025

Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation

arXiv 2024

2024

PARIS3D: Reasoning-based 3D Part Segmentation Using Large Multimodal Model

arXiv 2024

2024

Affiliations

No known affiliations.

Frequent co-authors

from 7 papers

Hisham Cholakkal

Rao Muhammad Anwer

Salman Khan

Fahad Shahbaz Khan

Noor Ahsan

Yuhao Li

Dinura Dissanayake

Ivan Laptev

Ketan More

Mohammed Irfan Kurpath

2 shared papers