Sheng Jin

Papers: 12

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

12papers

Authored papers

JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation

arXiv 2025

2025

Harmonizing Visual Representations for Unified Multimodal Understanding and Generation

ICCV 2025

2025

Beyond Correctness: Evaluating Subjective Writing Preferences Across Cultures

arXiv 2025

2025

COIG-Writer: A High-Quality Dataset for Chinese Creative Writing with Thought Processes

arXiv 2025

2025

FrozenSeg: Harmonizing Frozen Foundation Models for Open-Vocabulary Segmentation

arXiv 2024

2024

AutoMMLab: Automatically Generating Deployable Models from Language Instructions for Computer Vision Tasks

arXiv 2024

2024

F-LMM: Grounding Frozen Large Multimodal Models

CVPR 2025 1

2024

Vision-Language Models for Vision Tasks: A Survey

arXiv 2023

2023

CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction

arXiv 2023

2023

Uncertainty-aware Unsupervised Multi-Object Tracking

ICCV 2023 1

2023

CLIM: Contrastive Language-Image Mosaic for Region Representation

arXiv 2023

2023

You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-Person Multi-Task Human-Centric Perception

arXiv 2023

2023

Affiliations

No known affiliations.

Frequent co-authors

from 12 papers

Wentao Liu

Chen Change Loy

Lumin Xu

Size Wu

Wenwei Zhang

Chen Qian

Chenghua Lin

Jiazhan Feng

researcher

2 shared papers

Kai Liu

2 shared papers

Libo Qin

2 shared papers