Cite
Notes
Only stored in your browser.
Attribution
Seed1.5-VL Technical Report
arXiv 2025
MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs
arXiv 2024
ByteTransformer: A High-Performance Transformer Boosted for Variable-Length Inputs
arXiv 2022
from 3 papers
Xin Liu
Haibin Lin
Liang Xiang
Shipeng Yan
Yanghua Peng
Yangrui Chen
Aoxue Zhang
Bairen Yi
Bencheng Liao
Can Huang