Junlin Han

Papers: 9

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

9papers

Authored papers

Small Vision-Language Models are Smart Compressors for Long Video Understanding

arXiv 2026

2026

AuralSAM2: Enabling SAM2 Hear Through Pyramid Audio-Visual Feature Prompting

arXiv 2026

2025

Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model

CVPR 2025 1

2025

Hallucination at a Glance: Controlled Visual Edits and Fine-Grained Multimodal Learning

arXiv 2025

2025

From Pixels to Feelings: Aligning MLLMs with Human Cognitive Perception of Images

arXiv 2025

2025

EgoEdit: Dataset, Real-Time Streaming Model, and Benchmark for Egocentric Video Editing

arXiv 2025

2025

Semantic Score Distillation Sampling for Compositional Text-to-3D Generation

arXiv 2024

2024

How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs

arXiv 2023

2023

3D-GPT: Procedural 3D Modeling with Large Language Models

arXiv 2023

2023

Affiliations

No known affiliations.

Frequent co-authors

from 9 papers

Philip Torr

Runjia Li

Tianyi Bai

Wentao Zhang

Aliaksandr Siarohin

Arpit Sahni

Ashkan Mirzaei

Bingchen Zhao

Binhang Yuan

Bohan Zeng