Tianyu Gao

Papers: 16

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

16papers

Authored papers

Metadata Conditioning Accelerates Language Model Pre-training

arXiv 2025

2025

HELMET: How to Evaluate Long-Context Language Models Effectively and Thoroughly

arXiv 2024

2024

How to Train Long-Context Language Models (Effectively)

arXiv 2024

2024

Long-Context Language Modeling with Parallel Context Encoding

arXiv 2024

2024

LitSearch: A Retrieval Benchmark for Scientific Literature Search

arXiv 2024

2024

Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

arXiv 2023

2023

Fine-Tuning Language Models with Just Forward Passes

fine-tuning-language-models-with-just-forward

2023

Enabling Large Language Models to Generate Text with Citations

arXiv 2023

2023

Evaluating Large Language Models at Evaluating Instruction Following

arXiv 2023

2023

What In-Context Learning "Learns" In-Context: Disentangling Task Recognition and Task Learning

arXiv 2023

2023

Should You Mask 15% in Masked Language Modeling?

arXiv 2022

2022

SimCSE: Simple Contrastive Learning of Sentence Embeddings

EMNLP 2021 11

2021

Making Pre-trained Language Models Better Few-shot Learners

ACL 2021 5

2020

OpenNRE: An Open and Extensible Toolkit for Neural Relation Extraction

opennre-an-open-and-extensible-toolkit-for-1

2019

FewRel 2.0: Towards More Challenging Few-Shot Relation Classification

fewrel-20-towards-more-challenging-few-shot-1

2019

KEPLER: A Unified Model for Knowledge Embedding and Pre-trained Language Representation

arXiv 2019

2019

Affiliations

No known affiliations.

Frequent co-authors

from 16 papers

Danqi Chen

professor

13 shared papers

Howard Yen

4 shared papers

Alexander Wettig

researcher

3 shared papers

Zhiyuan Liu

professor

3 shared papers

Jiatong Yu

2 shared papers

Maosong Sun

professor

Mengzhou Xia

Sadhika Malladi

Tanya Goyal

Xu Han