Yiran Zhong
- Papers
- 14
Cite
Notes
Only stored in your browser.
Authored papers
14FlashSampling: Fast and Memory-Efficient Exact Sampling
arXiv 2026
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
arXiv 2025
Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme
arXiv 2025
Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models
arXiv 2024
Various Lengths, Constant Speed: Efficient Language Modeling with Lightning Attention
arXiv 2024
Linear Attention Sequence Parallelism
arXiv 2024
TAVGBench: Benchmarking Text to Audible-Video Generation
arXiv 2024
Scaling Laws for Linear Complexity Language Models
arXiv 2024
CO2: Efficient Distributed Training with Full Communication-Computation Overlap
arXiv 2024
Audio-Visual Segmentation with Semantics
arXiv 2023
Fine-grained Audible Video Description
CVPR 2023 1
Accelerating Toeplitz Neural Network with Constant-time Inference Complexity
arXiv 2023
cosFormer: Rethinking Softmax in Attention
cosformer-rethinking-softmax-in-attention
The Devil in Linear Transformer
arXiv 2022
Affiliations
Frequent co-authors
10from 14 papers