Cite
Notes
Only stored in your browser.
Attribution
MixCE: Training Autoregressive Language Models by Mixing Forward and Reverse Cross-Entropies
arXiv 2023
Beto, Bentz, Becas: The Surprising Cross-Lingual Effectiveness of BERT
beto-bentz-becas-the-surprising-cross-lingual-1
from 2 papers
Mark Dredze
David Rosenberg
Mohit Bansal
Ozan İrsoy
Shiyue Zhang
Steven Lu