Leandro von Werra
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9SmolVLM: Redefining small and efficient multimodal models
arXiv 2025
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language
arXiv 2025
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution
arXiv 2025
Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models
arXiv 2024
SelfCodeAlign: Self-Alignment for Code Generation
arXiv 2024
Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations
arXiv 2024
SantaCoder: don't reach for the stars!
arXiv 2023
OctoPack: Instruction Tuning Code Large Language Models
arXiv 2023
Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements
arXiv 2022
Affiliations
Frequent co-authors
10from 9 papers
Qian Liu
Terry Yue Zhuo
Harm de Vries
researcher
Loubna Ben allal
Niklas Muennighoff
grad-student
Thomas Wolf
chief-science-officer
Arjun Guha
Armel Zebaze
Binyuan Hui
Elie Bakouch