Mao Yang
- Papers
- 15
Cite
Notes
Only stored in your browser.
Authored papers
15rStar-Coder: Scaling Competitive Code Reasoning with a Large-Scale Verified Dataset
arXiv 2025
SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
arXiv 2025
RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference
arXiv 2025
rStar2-Agent: Agentic Reasoning Technical Report
arXiv 2025
LongRoPE2: Near-Lossless LLM Context Window Scaling
arXiv 2025
Gold-Medal-Level Olympiad Geometry Solving with Efficient Heuristic Auxiliary Constructions
arXiv 2025
T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on Edge
arXiv 2024
VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models
arXiv 2024
MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels
arXiv 2024
Compresso: Structured Pruning with Collaborative Prompting Learns Compact Large Language Models
arXiv 2023
Pre-gated MoE: An Algorithm-System Co-Design for Fast and Scalable Mixture-of-Expert Inference
arXiv 2023
IRGen: Generative Modeling for Image Retrieval
arXiv 2023
Tutel: Adaptive Mixture-of-Experts at Scale
arXiv 2022
SPANN: Highly-efficient Billion-scale Approximate Nearest Neighbor Search
spann-highly-efficient-billion-scale-1
WRENCH: A Comprehensive Benchmark for Weak Supervision
arXiv 2021
Affiliations
Frequent co-authors
10from 15 papers