Junlin Han
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9AuralSAM2: Enabling SAM2 Hear Through Pyramid Audio-Visual Feature Prompting
arXiv 2026
Small Vision-Language Models are Smart Compressors for Long Video Understanding
arXiv 2026
EgoEdit: Dataset, Real-Time Streaming Model, and Benchmark for Egocentric Video Editing
arXiv 2025
Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model
CVPR 2025 1
Hallucination at a Glance: Controlled Visual Edits and Fine-Grained Multimodal Learning
arXiv 2025
From Pixels to Feelings: Aligning MLLMs with Human Cognitive Perception of Images
arXiv 2025
Semantic Score Distillation Sampling for Compositional Text-to-3D Generation
arXiv 2024
3D-GPT: Procedural 3D Modeling with Large Language Models
arXiv 2023
How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs
arXiv 2023
Affiliations
Frequent co-authors
10from 9 papers