0

Yao Lu

Papers
26

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
26papers

Authored papers

26

StreamingVLM: Real-Time Understanding for Infinite Video Streams

arXiv 2025

2026

Scaling RL to Long Videos

arXiv 2025

2025

LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

arXiv 2025

2025

SepPrune: Structured Pruning for Efficient Deep Speech Separation

arXiv 2025

2025

VLASH: Real-Time VLAs via Future-State-Aware Asynchronous Inference

arXiv 2025

2025

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

arXiv 2025

2025

DC-AR: Efficient Masked Autoregressive Image Generation with Deep Compression Hybrid Tokenizer

arXiv 2025

2025

Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation

arXiv 2025

2025

Scaling Vision Pre-Training to 4K Resolution

CVPR 2025 1

2025

Drawing Conclusions from Draws: Rethinking Preference Semantics in Arena-Style LLM Evaluation

arXiv 2025

2025

HART: Efficient Visual Generation with Hybrid Autoregressive Transformer

arXiv 2024

2024

Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models

arXiv 2024

2024

NVILA: Efficient Frontier Visual Language Models

CVPR 2025 1

2024

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training

arXiv 2024

2024

RADIO Amplified: Improved Baselines for Agglomerative Vision Foundation Models

arXiv 2024

2024

VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge

CVPR 2025 1

2024

VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation

arXiv 2024

2024

Reassessing Layer Pruning in LLMs: New Insights and Methods

arXiv 2024

2024

UDA: A Benchmark Suite for Retrieval Augmented Generation in Real-world Document Analysis

arXiv 2024

2024

Wolf: Captioning Everything with a World Summarization Framework

arXiv 2024

2024

VILA: On Pre-training for Visual Language Models

CVPR 2024 1

2023

Open X-Embodiment: Robotic Learning Datasets and RT-X Models

arXiv 2023

2023

Strings from the Library of Babel: Random Sampling as a Strong Baseline for Prompt Optimisation

arXiv 2023

2023

RT-1: Robotics Transformer for Real-World Control at Scale

arXiv 2022

2022

Learning to Estimate Hidden Motions with Global Motion Aggregation

ICCV 2021 10

2021

Multi-XScience: A Large-scale Dataset for Extreme Multi-document Summarization of Scientific Articles

EMNLP 2020 11

2020

Affiliations

No known affiliations.

Frequent co-authors

10

from 26 papers