PlanarBench: Evaluating LLM Spatial Reasoning via Planar Graph Drawing

Open

Preview
Year: 2026
ArXiv: arxiv.org/abs/2606.02010
URL: arxiv.org/abs/2606.02010
Hosting: Full text hostedCC-BY-SA-4.0

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text: arxiv.org/abs/2606.02010CC-BY-SA-4.0
TL;DR: Semantic Scholar

Attribution policy →

Abstract

PlanarBench tests whether LLMs can draw planar graphs as ASCII art given only an edge list -- a spatial reasoning task that resists memorization because edge order, edge orientation, and node labels are all permutable. We evaluate 91 models on the 199 simplest non-isomorphic connected planar graphs (2 - 7 vertices). Edge count is the dominant difficulty predictor ($r = -0.85$) -- a finding not reported in prior LLM graph benchmarks, which use only node count as the difficulty axis.