Cite
Notes
Only stored in your browser.
Attribution
LiveCodeBench Pro: How Do Olympiad Medalists Judge LLMs in Competitive Programming?
arXiv 2025
SPIN-Bench: How Well Do LLMs Plan Strategically and Reason Socially?
FrontierCS: Evolving Challenges for Evolving Intelligence
from 3 papers
Jianzhu Yao
Pramod Viswanath
Aleksandra Korolova
Jingbo Shang
Kaiyuan Liu
Peter Henderson
Saining Xie
Shang Zhou
Wenhao Chai
Zeyu Shen