0

Juncheng Li

Papers
21

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
21papers

Authored papers

21

InstructSAM: Segment Any Instance with Any Instructions

arXiv 2026

2026

OpenRT: An Open-Source Red Teaming Framework for Multimodal LLMs

arXiv 2026

2026

Evolve the Method, Not the Prompts: Evolutionary Synthesis of Jailbreak Attacks on LLMs

arXiv 2025

2025

BackdoorVLM: A Benchmark for Backdoor Attacks on Vision-Language Models

arXiv 2025

2025

WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation

arXiv 2025

2025

Decoupled Seg Tokens Make Stronger Reasoning Video Segmenter and Grounder

arXiv 2025

2025

Expanding the Action Space of LLMs to Reason Beyond Language

arXiv 2025

2025

HarmonyGuard: Toward Safety and Utility in Web Agents via Adaptive Policy Enhancement and Dual-Objective Optimization

arXiv 2025

2025

On Path to Multimodal Generalist: General-Level and General-Bench

arXiv 2025

2025

WiseEdit: Benchmarking Cognition- and Creativity-Informed Image Editing

arXiv 2025

2025

MoA: Heterogeneous Mixture of Adapters for Parameter-Efficient Fine-Tuning of Large Language Models

arXiv 2025

2025

The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation

ICCV 2025

2025

Align$^2$LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation

arXiv 2024

2024

Momentor: Advancing Video Large Language Model with Fine-Grained Temporal Reasoning

arXiv 2024

2024

HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editing

arXiv 2024

2024

HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models

arXiv 2024

2024

Self-supervised Meta-Prompt Learning with Meta-Gradient Regularization for Few-shot Generalization

arXiv 2023

2023

Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions

arXiv 2023

2023

HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data

CVPR 2024 1

2023

Visually-Prompted Language Model for Fine-Grained Scene Graph Generation in an Open World

ICCV 2023 1

2023

Masked Autoencoders that Listen

arXiv 2022

2022

Affiliations

No known affiliations.

Frequent co-authors

10

from 21 papers