Dong Li
- Papers
- 24
Cite
Notes
Only stored in your browser.
Authored papers
24AgentKernelArena: Generalization-Aware Benchmarking of GPU Kernel Optimization Agents
arXiv 2026
Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE
arXiv 2025
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
arXiv 2025
A Survey of Reinforcement Learning for Large Reasoning Models
arXiv 2025
GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning
arXiv 2025
AMD-Hummingbird: Towards an Efficient Text-to-Video Model
arXiv 2025
Step-GUI Technical Report
arXiv 2025
Graph Counselor: Adaptive Graph Exploration via Multi-Agent Synergy to Enhance LLM Reasoning
arXiv 2025
PARD: Accelerating LLM Inference with Low-Cost PARallel Draft Model Adaptation
arXiv 2025
Gumiho: A Hybrid Architecture to Prioritize Early Tokens in Speculative Decoding
arXiv 2025
Enhancing Financial Time-Series Forecasting with Retrieval-Augmented Large Language Models
arXiv 2025
Align Anything: Training All-Modality Models to Follow Instructions with Language Feedback
arXiv 2024
Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models
arXiv 2024
Various Lengths, Constant Speed: Efficient Language Modeling with Lightning Attention
arXiv 2024
iVideoGPT: Interactive VideoGPTs are Scalable World Models
arXiv 2024
Scaling Laws for Linear Complexity Language Models
arXiv 2024
CO2: Efficient Distributed Training with Full Communication-Computation Overlap
arXiv 2024
Linear Attention Sequence Parallelism
arXiv 2024
FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision Making
arXiv 2024
UMAD: University of Macau Anomaly Detection Benchmark Dataset
arXiv 2024
EGSRAL: An Enhanced 3D Gaussian Splatting based Renderer with Automated Labeling for Large-Scale Driving Scene
arXiv 2024
ReNeg: Learning Negative Embedding with Reward Guidance
CVPR 2025 1
Fine-grained Audible Video Description
CVPR 2023 1
HarmonyDream: Task Harmonization Inside World Models
arXiv 2023
Affiliations
Frequent co-authors
10from 24 papers