Zhe Li
- Papers
- 19
Cite
Notes
Only stored in your browser.
Authored papers
19RoboBrain 2.5: Depth in Sight, Time in Mind
arXiv 2026
Significance and Stability Analysis of Gene-Environment Interaction using RGxEStat
arXiv 2026
Composed Multi-modal Retrieval: A Survey of Approaches and Applications
arXiv 2025
BodyGen: Advancing Towards Efficient Embodiment Co-Design
bodygen-advancing-towards-efficient
Label-free Motion-Conditioned Diffusion Model for Cardiac Ultrasound Synthesis
arXiv 2025
Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
arXiv 2025
Agentar-Fin-R1: Enhancing Financial Intelligence through Domain Expertise, Training Efficiency, and Advanced Reasoning
arXiv 2025
Spectral-Aware Low-Rank Adaptation for Speaker Verification
arXiv 2025
MVAD : A Comprehensive Multimodal Video-Audio Dataset for AIGC Detection
arXiv 2025
ViRC: Enhancing Visual Interleaved Mathematical CoT with Reason Chunking
arXiv 2025
Q-MLLM: Vector Quantization for Robust Multimodal Large Language Model Security
arXiv 2025
MeshAvatar: Learning High-quality Triangular Human Avatars from Multi-view Videos
arXiv 2024
MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs
arXiv 2024
Are AI-Generated Text Detectors Robust to Adversarial Perturbations?
arXiv 2024
Track Anything: Segment Anything Meets Videos
arXiv 2023
Animatable and Relightable Gaussians for High-fidelity Human Avatar Modeling
arXiv 2023
HHAvatar: Gaussian Head Avatar with Dynamic Hairs
CVPR 2024 1
Caption Anything: Interactive Image Description with Diverse Multimodal Controls
arXiv 2023
From Knowledge Distillation to Self-Knowledge Distillation: A Unified Approach with Normalized Loss and Customized Soft Labels
ICCV 2023 1
Affiliations
Frequent co-authors
10from 19 papers