Zhenheng Tang
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9CloneMem: Benchmarking Long-Term Memory for AI Clones
arXiv 2026
EpochX: Building the Infrastructure for an Emergent Agent Civilization
arXiv 2026
ChunkKV: Semantic-Preserving KV Cache Compression for Efficient Long-Context LLM Inference
arXiv 2025
FSMoE: A Flexible and Scalable Training System for Sparse Mixture-of-Experts Models
arXiv 2025
Can Compressed LLMs Truly Act? An Empirical Evaluation of Agentic Capabilities in LLM Compression
arXiv 2025
BurstGPT: A Real-world Workload Dataset to Optimize LLM Serving Systems
arXiv 2024
Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for Large Language Models
arXiv 2024
Should We Really Edit Language Models? On the Evaluation of Edited Language Models
arXiv 2024
NAS-LID: Efficient Neural Architecture Search with Local Intrinsic Dimension
arXiv 2022
Affiliations
Frequent co-authors
10from 9 papers