Cite
Notes
Only stored in your browser.
Attribution
Dynamic data sampler for cross-language transfer learning in large language models
arXiv 2024
Weight-Inherited Distillation for Task-Agnostic BERT Compression
arXiv 2023
TencentPretrain: A Scalable and Flexible Toolkit for Pre-training Models of Different Modalities
arXiv 2022
from 3 papers
Zhe Zhao
Linlin Shen
Taiqiang Wu
Yudong Li
Chen Chen
Feifei Li
Han Guo
Haoyan Liu
Jiayi Li
Jing Zhao