Ruochen Zhang
- Papers
- 7
Cite
Notes
Only stored in your browser.
Authored papers
7Datasheets Aren't Enough: DataRubrics for Automated Quality Metrics and Accountability
arXiv 2025
Crosslingual Reasoning through Test-Time Scaling
arXiv 2025
Beyond Contrastive Learning: Synthetic Data Enables List-wise Training with Multiple Levels of Relevance
arXiv 2025
Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia
arXiv 2025
MINERS: Multilingual Language Models as Semantic Retrievers
arXiv 2024
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
arXiv 2024
LayoutParser: A Unified Toolkit for Deep Learning Based Document Image Analysis
arXiv 2021
Affiliations
Frequent co-authors
10from 7 papers
Genta Indra Winata
Alham Fikri Aji
Jan Christian Blaise Cruz
Niklas Muennighoff
grad-student
Zheng Xin Yong
researcher
Ayu Purwarianti
Bin Wang
Börje F. Karlsson
Carsten Eickhoff
Dan John Velasco