Cite
Notes
Only stored in your browser.
Attribution
FutureSim: Replaying World Events to Evaluate Adaptive Agents
arXiv 2026
Scaling Open-Ended Reasoning to Predict the Future
arXiv 2025
Answer Matching Outperforms Multiple Choice for Language Model Evaluation
from 3 papers
Ameya Prabhu
Jonas Geiping
Moritz Hardt
Shashwat Goel
Arvindh Arun
Maksym Andriushchenko
Steffen Staab