Kehan Li
- Papers
- 14
Cite
Notes
Only stored in your browser.
Authored papers
14RynnBrain: Open Embodied Foundation Models
arXiv 2026
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding
arXiv 2025
RynnVLA-002: A Unified Vision-Language-Action and World Model
arXiv 2025
RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation
arXiv 2025
RynnEC: Bringing MLLMs into Embodied World
arXiv 2025
Advances in 4D Generation: A Survey
arXiv 2025
Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss
arXiv 2024
SpeechCraft: A Fine-grained Expressive Speech Dataset with Natural Language Description
arXiv 2024
Instance Brownian Bridge as Texts for Open-vocabulary Video Instance Segmentation
arXiv 2024
GraCo: Granularity-Controllable Interactive Segmentation
CVPR 2024 1
Large-vocabulary forensic pathological analyses via prototypical cross-modal contrastive learning
arXiv 2024
DiffusionRet: Generative Text-Video Retrieval with Diffusion Model
ICCV 2023 1
FreestyleRet: Retrieving Images from Style-Diversified Queries
arXiv 2023
Position Embedding Needs an Independent Layer Normalization
arXiv 2022
Affiliations
Frequent co-authors
10from 14 papers