Shunian Chen
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13Do Phone-Use Agents Respect Your Privacy?
arXiv 2026
ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation
arXiv 2025
FusionAudio-1.2M: Towards Fine-grained Audio Captioning with Multimodal Contextual Fusion
arXiv 2025
TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesis
arXiv 2025
MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos
arXiv 2025
BlenderLLM: Training Large Language Models for Computer-Aided Design with Self-improvement
arXiv 2024
HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale
arXiv 2024
Less is More: A Simple yet Effective Token Reduction Method for Efficient Multi-modal LLMs
arXiv 2024
ALLaVA: Harnessing GPT4V-Synthesized Data for Lite Vision-Language Models
arXiv 2024
LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via a Hybrid Architecture
arXiv 2024
Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination
arXiv 2024
Humans or LLMs as the Judge? A Study on Judgement Biases
arXiv 2024
HuatuoGPT-II, One-stage Training for Medical Adaption of LLMs
arXiv 2023
Affiliations
Frequent co-authors
10from 13 papers