Cite
Notes
Only stored in your browser.
Attribution
MultiChallenge: A Realistic Multi-Turn Conversation Evaluation Benchmark Challenging to Frontier LLMs
arXiv 2025
from 1 papers
Chen Xing
Dean Lee
Ed-Yeremai Cardona
Johannes Mols
Kaustubh Deshpande
Lifeng Jin
Summer Yue
researcher
Ved Sirdeshmukh
Willow Primack