Nenghai Yu
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13HiAR: Efficient Autoregressive Long Video Generation via Hierarchical Denoising
arXiv 2026
A Survey of Safety on Large Vision-Language Models: Attacks, Defenses and Evaluations
arXiv 2025
ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing
arXiv 2025
De-AntiFake: Rethinking the Protective Perturbations Against Voice Cloning Attacks
arXiv 2025
M3-AGIQA: Multimodal, Multi-Round, Multi-Aspect AI-Generated Image Quality Assessment
arXiv 2025
OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation
CVPR 2024 1
Diversity-Aware Meta Visual Prompting
CVPR 2023 1
Watermarking Text Generated by Black-Box Language Models
arXiv 2023
Improving Adversarial Robustness of Masked Autoencoders via Test-time Frequency-domain Prompting
ICCV 2023 1
Reduce Information Loss in Transformers for Pluralistic Image Inpainting
CVPR 2022 1
CLIP Itself is a Strong Fine-tuner: Achieving 85.7% and 88.0% Top-1 Accuracy with ViT-B and ViT-L on ImageNet
arXiv 2022
HairCLIP: Design Your Hair by Text and Reference Image
CVPR 2022 1
CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped Windows
cswin-transformer-a-general-vision-1
Affiliations
Frequent co-authors
10from 13 papers