Zhewei Yao
- Papers
- 16
Cite
Notes
Only stored in your browser.
Authored papers
16Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning
arXiv 2026
Learning to Hint for Reinforcement Learning
arXiv 2026
Arctic Long Sequence Training: Scalable And Efficient Training For Multi-Million Token Sequences
arXiv 2025
Arctic-Text2SQL-R1: Simple Rewards, Strong Reasoning in Text-to-SQL
arXiv 2025
ReFoRCE: A Text-to-SQL Agent with Self-Refinement, Format Restriction, and Column Exploration
arXiv 2025
Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding
arXiv 2024
DeepSpeed-VisualChat: Multi-Round Multi-Image Interleave Chat via Multi-Modal Causal Attention
arXiv 2023
I-BERT: Integer-only BERT Quantization
arXiv 2021
How Much Can CLIP Benefit Vision-and-Language Tasks?
arXiv 2021
Hessian-Aware Pruning and Optimal Neural Implant
arXiv 2021
ADAHESSIAN: An Adaptive Second Order Optimizer for Machine Learning
arXiv 2020
HAWQV3: Dyadic Neural Network Quantization
arXiv 2020
ZeroQ: A Novel Zero Shot Quantization Framework
zeroq-a-novel-zero-shot-quantization-1
PowerNorm: Rethinking Batch Normalization in Transformers
ICML 2020 1
HAWQ: Hessian AWare Quantization of Neural Networks with Mixed-Precision
hawq-hessian-aware-quantization-of-neural
HAWQ-V2: Hessian Aware trace-Weighted Quantization of Neural Networks
NeurIPS 2020 12
Affiliations
Frequent co-authors
10from 16 papers