0

Lei Zhu

Papers
29

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
29papers

Authored papers

29

GenEvolve: Self-Evolving Image Generation Agents via Tool-Orchestrated Visual Experience Distillation

arXiv 2026

2026

VQ-Seg: Vector-Quantized Token Perturbation for Semi-Supervised Medical Image Segmentation

arXiv 2026

2026

MTR-DuplexBench: Towards a Comprehensive Evaluation of Multi-Round Conversations for Full-Duplex Speech Language Models

arXiv 2026

2026

AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security

arXiv 2026

2026

LucidFlux: Caption-Free Universal Image Restoration via a Large-Scale Diffusion Transformer

arXiv 2025

2025

PosterCraft: Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework

arXiv 2025

2025

Medical World Model: Generative Simulation of Tumor Evolution for Treatment Planning

arXiv 2025

2025

UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios

arXiv 2025

2025

An Empirical Study of GPT-4o Image Generation Capabilities

arXiv 2025

2025

V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer

arXiv 2025

2025

The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning VLMs

arXiv 2025

2025

SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image Segmentation

arXiv 2024

2024

NeuroClips: Towards High-fidelity and Smooth fMRI-to-Video Reconstruction

arXiv 2024

2024

Vivim: a Video Vision Mamba for Medical Video Segmentation

arXiv 2024

2024

RelayAttention for Efficient Large Language Model Serving with Long System Prompts

arXiv 2024

2024

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

arXiv 2024

2024

Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99%

arXiv 2024

2024

Beyond Text: Frozen Large Language Models in Visual Signal Comprehension

CVPR 2024 1

2024

Touchstone Benchmark: Are We on the Right Way for Evaluating AI Algorithms for Medical Segmentation?

arXiv 2024

2024

Revisiting the Integration of Convolution and Attention for Vision Backbone

arXiv 2024

2024

BiFormer: Vision Transformer with Bi-Level Routing Attention

CVPR 2023 1

2023

Masked Image Training for Generalizable Deep Image Denoising

CVPR 2023 1

2023

Sparse Sampling Transformer with Uncertainty-Driven Ranking for Unified Removal of Raindrops and Rain Streaks

ICCV 2023 1

2023

OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation

arXiv 2023

2023

Benchmarking Large Language Models on CMExam -- A Comprehensive Chinese Medical Exam Dataset

arXiv 2023

2023

Video Adverse-Weather-Component Suppression Network via Weather Messenger and Adversarial Backpropagation

ICCV 2023 1

2023

Towards High-Quality Specular Highlight Removal by Leveraging Large-Scale Synthetic Data

ICCV 2023 1

2023

Involution: Inverting the Inherence of Convolution for Visual Recognition

CVPR 2021 1

2021

Leveraging the Invariant Side of Generative Zero-Shot Learning

leveraging-the-invariant-side-of-generative-1

2019

Affiliations

No known affiliations.

Frequent co-authors

10

from 29 papers