multi turn dialog
- Slug
multi-turn-dialog- Evals
- 7
- Tools
- 27
- Models
- 322
- Papers
- 5
Evals testing this capability
7Tools lifting evals here
27Top models on this capability
322by avg parsed score across evals here
Papers in this area
5introducesALFWorld: Aligning Text and Embodied Environments for Interactive LearningintroducesGAIA: A Benchmark for General AI AssistantsintroducesJudging LLM-as-a-Judge with MT-Bench and Chatbot Arenaintroducesτ-bench: A Benchmark for Tool-Agent-User Interaction in Real-World DomainsintroducesTextArena: Multi-Agent Text-Based Games for LLM Evaluation



