Yao Lu
- Papers
- 26
Cite
Notes
Only stored in your browser.
Authored papers
26StreamingVLM: Real-Time Understanding for Infinite Video Streams
arXiv 2025
Scaling RL to Long Videos
arXiv 2025
LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention
arXiv 2025
SepPrune: Structured Pruning for Efficient Deep Speech Separation
arXiv 2025
VLASH: Real-Time VLAs via Future-State-Aware Asynchronous Inference
arXiv 2025
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs
arXiv 2025
DC-AR: Efficient Masked Autoregressive Image Generation with Deep Compression Hybrid Tokenizer
arXiv 2025
Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation
arXiv 2025
Scaling Vision Pre-Training to 4K Resolution
CVPR 2025 1
Drawing Conclusions from Draws: Rethinking Preference Semantics in Arena-Style LLM Evaluation
arXiv 2025
HART: Efficient Visual Generation with Hybrid Autoregressive Transformer
arXiv 2024
Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models
arXiv 2024
NVILA: Efficient Frontier Visual Language Models
CVPR 2025 1
COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training
arXiv 2024
RADIO Amplified: Improved Baselines for Agglomerative Vision Foundation Models
arXiv 2024
VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge
CVPR 2025 1
VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation
arXiv 2024
Reassessing Layer Pruning in LLMs: New Insights and Methods
arXiv 2024
UDA: A Benchmark Suite for Retrieval Augmented Generation in Real-world Document Analysis
arXiv 2024
Wolf: Captioning Everything with a World Summarization Framework
arXiv 2024
VILA: On Pre-training for Visual Language Models
CVPR 2024 1
Open X-Embodiment: Robotic Learning Datasets and RT-X Models
arXiv 2023
Strings from the Library of Babel: Random Sampling as a Strong Baseline for Prompt Optimisation
arXiv 2023
RT-1: Robotics Transformer for Real-World Control at Scale
arXiv 2022
Learning to Estimate Hidden Motions with Global Motion Aggregation
ICCV 2021 10
Multi-XScience: A Large-scale Dataset for Extreme Multi-document Summarization of Scientific Articles
EMNLP 2020 11
Affiliations
Frequent co-authors
10from 26 papers