Xuezhe Ma

Papers: 9

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

9papers

Authored papers

Gecko: An Efficient Neural Architecture Inherently Processing Sequences with Arbitrary Lengths

arXiv 2026

2026

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

arXiv 2024

2024

DISTFLASHATTN: Distributed Memory-efficient Attention for Long-context LLMs Training

arXiv 2023

2023

Evaluating Large Language Models on Controlled Generation Tasks

arXiv 2023

2023

Look-back Decoding for Open-Ended Text Generation

arXiv 2023

2023

Mega: Moving Average Equipped Gated Attention

arXiv 2022

2022

Improving Stability of Fine-Tuning Pretrained Language Models via Component-Wise Gradient Norm Clipping

arXiv 2022

2022

Better May Not Be Fairer: A Study on Subgroup Discrepancy in Image Classification

ICCV 2023 1

2022

Towards a Unified View of Parameter-Efficient Transfer Learning

towards-a-unified-view-of-parameter-efficient

2021

Affiliations

No known affiliations.

Frequent co-authors

from 9 papers

Chunting Zhou

4 shared papers

Hao Zhang

professor

3 shared papers

Jonathan May

3 shared papers

Luke Zettlemoyer

professor

3 shared papers

Graham Neubig

professor

Junxian He

Nan Xu

Anze Xie

Asli Celikyilmaz

Beidi Chen