Jean Lahoud
- Papers
- 7
Cite
Notes
Only stored in your browser.
7papers
Authored papers
7LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM
arXiv 2025
A Benchmark and Agentic Framework for Omni-Modal Reasoning and Tool Use in Long Videos
arXiv 2025
Agent-X: Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks
arXiv 2025
DriveLMM-o1: A Step-by-Step Reasoning Dataset and Large Multimodal Model for Driving Scenario Understanding
arXiv 2025
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs
arXiv 2025
Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation
arXiv 2024
PARIS3D: Reasoning-based 3D Part Segmentation Using Large Multimodal Model
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 7 papers