Jiawei Zhao
- Papers
- 8
Cite
Notes
Only stored in your browser.
Authored papers
8VideoSeeker: Incentivizing Instance-level Video Understanding via Native Agentic Tool Invocation
arXiv 2026
AstraFlow: Dataflow-Oriented Reinforcement Learning for Agentic LLMs
arXiv 2026
HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading
arXiv 2025
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
arXiv 2024
Mini-Sequence Transformer: Optimizing Intermediate Memory for Long Sequences Training
arXiv 2024
From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients
arXiv 2024
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients
arXiv 2024
ZerO Initialization: Initializing Neural Networks with only Zeros and Ones
zero-initialization-initializing-residual
Affiliations
Frequent co-authors
10from 8 papers