Lei Zhu
- Papers
- 29
Cite
Notes
Only stored in your browser.
Authored papers
29GenEvolve: Self-Evolving Image Generation Agents via Tool-Orchestrated Visual Experience Distillation
arXiv 2026
VQ-Seg: Vector-Quantized Token Perturbation for Semi-Supervised Medical Image Segmentation
arXiv 2026
MTR-DuplexBench: Towards a Comprehensive Evaluation of Multi-Round Conversations for Full-Duplex Speech Language Models
arXiv 2026
AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security
arXiv 2026
LucidFlux: Caption-Free Universal Image Restoration via a Large-Scale Diffusion Transformer
arXiv 2025
PosterCraft: Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework
arXiv 2025
Medical World Model: Generative Simulation of Tumor Evolution for Treatment Planning
arXiv 2025
UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios
arXiv 2025
An Empirical Study of GPT-4o Image Generation Capabilities
arXiv 2025
V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer
arXiv 2025
The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning VLMs
arXiv 2025
SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image Segmentation
arXiv 2024
NeuroClips: Towards High-fidelity and Smooth fMRI-to-Video Reconstruction
arXiv 2024
Vivim: a Video Vision Mamba for Medical Video Segmentation
arXiv 2024
RelayAttention for Efficient Large Language Model Serving with Long System Prompts
arXiv 2024
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
arXiv 2024
Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99%
arXiv 2024
Beyond Text: Frozen Large Language Models in Visual Signal Comprehension
CVPR 2024 1
Touchstone Benchmark: Are We on the Right Way for Evaluating AI Algorithms for Medical Segmentation?
arXiv 2024
Revisiting the Integration of Convolution and Attention for Vision Backbone
arXiv 2024
BiFormer: Vision Transformer with Bi-Level Routing Attention
CVPR 2023 1
Masked Image Training for Generalizable Deep Image Denoising
CVPR 2023 1
Sparse Sampling Transformer with Uncertainty-Driven Ranking for Unified Removal of Raindrops and Rain Streaks
ICCV 2023 1
OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation
arXiv 2023
Benchmarking Large Language Models on CMExam -- A Comprehensive Chinese Medical Exam Dataset
arXiv 2023
Video Adverse-Weather-Component Suppression Network via Weather Messenger and Adversarial Backpropagation
ICCV 2023 1
Towards High-Quality Specular Highlight Removal by Leveraging Large-Scale Synthetic Data
ICCV 2023 1
Involution: Inverting the Inherence of Convolution for Visual Recognition
CVPR 2021 1
Leveraging the Invariant Side of Generative Zero-Shot Learning
leveraging-the-invariant-side-of-generative-1
Affiliations
Frequent co-authors
10from 29 papers