Daniel Balcells

Papers: 1

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

1papers

Authored papers

1

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

arXiv 2025

Affiliations

No known affiliations.

Frequent co-authors

10

from 1 papers

Bo Liu

researcher

Cheston Tan

researcher

Leon Guertler

researcher

Mickel Liu

researcher

Min Lin

Natasha Jaques

Penghui Qi

Simon Yu

researcher

Wee Sun Lee

Weiyan Shi