Tianyu Gao
- Papers
- 16
Cite
Notes
Only stored in your browser.
Authored papers
16Metadata Conditioning Accelerates Language Model Pre-training
arXiv 2025
HELMET: How to Evaluate Long-Context Language Models Effectively and Thoroughly
arXiv 2024
How to Train Long-Context Language Models (Effectively)
arXiv 2024
Long-Context Language Modeling with Parallel Context Encoding
arXiv 2024
LitSearch: A Retrieval Benchmark for Scientific Literature Search
arXiv 2024
Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
arXiv 2023
Fine-Tuning Language Models with Just Forward Passes
fine-tuning-language-models-with-just-forward
Enabling Large Language Models to Generate Text with Citations
arXiv 2023
Evaluating Large Language Models at Evaluating Instruction Following
arXiv 2023
What In-Context Learning "Learns" In-Context: Disentangling Task Recognition and Task Learning
arXiv 2023
Should You Mask 15% in Masked Language Modeling?
arXiv 2022
SimCSE: Simple Contrastive Learning of Sentence Embeddings
EMNLP 2021 11
Making Pre-trained Language Models Better Few-shot Learners
ACL 2021 5
FewRel 2.0: Towards More Challenging Few-Shot Relation Classification
fewrel-20-towards-more-challenging-few-shot-1
KEPLER: A Unified Model for Knowledge Embedding and Pre-trained Language Representation
arXiv 2019
OpenNRE: An Open and Extensible Toolkit for Neural Relation Extraction
opennre-an-open-and-extensible-toolkit-for-1
Affiliations
Frequent co-authors
10from 16 papers
Danqi Chen
professor
Howard Yen
Alexander Wettig
researcher
Zhiyuan Liu
professor
Jiatong Yu
Maosong Sun
professor
Mengzhou Xia
Sadhika Malladi
Tanya Goyal
Xu Han