Shunian Chen

Papers: 13

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

13papers

Authored papers

Do Phone-Use Agents Respect Your Privacy?

arXiv 2026

2026

ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation

arXiv 2025

2025

FusionAudio-1.2M: Towards Fine-grained Audio Captioning with Multimodal Contextual Fusion

arXiv 2025

2025

TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesis

arXiv 2025

2025

MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos

arXiv 2025

2025

BlenderLLM: Training Large Language Models for Computer-Aided Design with Self-improvement

arXiv 2024

2024

HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale

arXiv 2024

2024

Less is More: A Simple yet Effective Token Reduction Method for Efficient Multi-modal LLMs

arXiv 2024

2024

ALLaVA: Harnessing GPT4V-Synthesized Data for Lite Vision-Language Models

arXiv 2024

2024

LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via a Hybrid Architecture

arXiv 2024

2024

Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination

arXiv 2024

2024

Humans or LLMs as the Judge? A Study on Judgement Biases

arXiv 2024

2024

HuatuoGPT-II, One-stage Training for Medical Adaption of LLMs

arXiv 2023

2023

Affiliations

No known affiliations.

Frequent co-authors

from 13 papers

Benyou Wang

Junying Chen

Xidong Wang

Dingjie Song

Ke Ji

Guiming Hardy Chen

Xiang Wan

Zhenyang Cai

Anningzhe Gao

Feng Jiang