Mengzhou Xia
- Papers
- 12
Cite
Notes
Only stored in your browser.
Authored papers
12The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning
arXiv 2025
Goedel-Prover: A Frontier Model for Open-Source Automated Theorem Proving
arXiv 2025
MoDoMoDo: Multi-Domain Data Mixtures for Multimodal LLM Reinforcement Learning
arXiv 2025
CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs
arXiv 2024
LESS: Selecting Influential Data for Targeted Instruction Tuning
arXiv 2024
SimPO: Simple Preference Optimization with a Reference-Free Reward
arXiv 2024
LitSearch: A Retrieval Benchmark for Scientific Literature Search
arXiv 2024
Language Models as Science Tutors
arXiv 2024
Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
arXiv 2023
Catastrophic Jailbreak of Open-source LLMs via Exploiting Generation
arXiv 2023
Structured Pruning Learns Compact and Accurate Models
ACL 2022 5
MABEL: Attenuating Gender Bias using Textual Entailment Data
arXiv 2022
Affiliations
Frequent co-authors
10from 12 papers