Tao Ge

Papers: 17

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

17papers

Authored papers

Orchard: An Open-Source Agentic Modeling Framework

arXiv 2026

2026

Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative Decoding

arXiv 2024

2024

Router-Tuning: A Simple and Effective Approach for Enabling Dynamic-Depth in Transformers

arXiv 2024

2024

xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token

arXiv 2024

2024

Scaling Synthetic Data Creation with 1,000,000,000 Personas

arXiv 2024

2024

Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning

arXiv 2024

2024

Inference with Reference: Lossless Acceleration of Large Language Models

arXiv 2023

2023

In-context Autoencoder for Context Compression in a Large Language Model

arXiv 2023

2023

Smart Word Suggestions for Writing Assistance

arXiv 2023

2023

SCALE: Synergized Collaboration of Asymmetric Language Translation Engines

arXiv 2023

2023

Low-code LLM: Graphical User Interface over Large Language Models

arXiv 2023

2023

Unleashing the Emergent Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration

arXiv 2023

2023

ALYMPICS: LLM Agents Meet Game Theory -- Exploring Strategic Decision-Making with AI Agents

arXiv 2023

2023

Speculative Decoding: Exploiting Speculative Execution for Accelerating Seq2seq Generation

arXiv 2022

2022

Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting

EMNLP 2021 11

2021

BERT-of-Theseus: Compressing BERT by Progressive Module Replacing

EMNLP 2020 11

2020

BERT Loses Patience: Fast and Robust Inference with Early Exit

NeurIPS 2020 12

2020

Affiliations

No known affiliations.

Frequent co-authors

from 17 papers

Furu Wei

Xun Wang

Shaoguang Mao

Si-Qing Chen

Wenshan Wu

Canwen Xu

Dong Yu

Dongyan Zhao

Wangchunshu Zhou

Yan Xia