Tao Ge
- Papers
- 17
Cite
Notes
Only stored in your browser.
Authored papers
17Orchard: An Open-Source Agentic Modeling Framework
arXiv 2026
Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative Decoding
arXiv 2024
Router-Tuning: A Simple and Effective Approach for Enabling Dynamic-Depth in Transformers
arXiv 2024
xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token
arXiv 2024
Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning
arXiv 2024
Scaling Synthetic Data Creation with 1,000,000,000 Personas
arXiv 2024
Inference with Reference: Lossless Acceleration of Large Language Models
arXiv 2023
In-context Autoencoder for Context Compression in a Large Language Model
arXiv 2023
ALYMPICS: LLM Agents Meet Game Theory -- Exploring Strategic Decision-Making with AI Agents
arXiv 2023
SCALE: Synergized Collaboration of Asymmetric Language Translation Engines
arXiv 2023
Low-code LLM: Graphical User Interface over Large Language Models
arXiv 2023
Unleashing the Emergent Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration
arXiv 2023
Smart Word Suggestions for Writing Assistance
arXiv 2023
Speculative Decoding: Exploiting Speculative Execution for Accelerating Seq2seq Generation
arXiv 2022
Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting
EMNLP 2021 11
BERT Loses Patience: Fast and Robust Inference with Early Exit
NeurIPS 2020 12
BERT-of-Theseus: Compressing BERT by Progressive Module Replacing
EMNLP 2020 11
Affiliations
Frequent co-authors
10from 17 papers