Lester James V. Miranda
- Papers
- 12
Cite
Notes
Only stored in your browser.
Authored papers
12Polyglot Teachers: Evaluating Language Models for Multilingual Synthetic Data Generation
arXiv 2026
Multilinguality at the Edge: Developing Language Models for the Global South
arXiv 2026
Olmo 3
arXiv 2025
R3: Robust Rubric-Agnostic Reward Models
arXiv 2025
Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia
arXiv 2025
The UD-NewsCrawl Treebank: Reflections and Challenges from a Large-scale Tagalog Syntactic Annotation Project
arXiv 2025
2 OLMo 2 Furious
arXiv 2024
Tulu 3: Pushing Frontiers in Open Language Model Post-Training
preprint
M-RewardBench: Evaluating Reward Models in Multilingual Settings
arXiv 2024
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
arXiv 2024
Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback
arXiv 2024
calamanCy: A Tagalog Natural Language Processing Toolkit
arXiv 2023
Affiliations
Frequent co-authors
10from 12 papers
Faeze Brahman
researcher
Hannaneh Hajishirzi
professor
Nathan Lambert
researcher
Noah A. Smith
Pradeep Dasigi
Valentina Pyatkin
research-scientist
Genta Indra Winata
Hamish Ivison
grad-student
Jacob Morrison
research-engineer
Luca Soldaini