0

AutoB2G: Agentic Simulation and Reinforcement Learning for Spatio-Temporal Grid-Interactive Building Control

Grid-interactive building control has emerged as a promising approach for improving demand-side flexibility in modern power systems. Realistic studies of such systems, however, require tightly coupled co-simulation across buildings, reinforcement learning (RL), and distribution…

Preview
Year
2026
Hosting
Abstract onlyARXIV-DEFAULT

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text
arxiv.org/abs/2603.26005ARXIV-DEFAULT
TL;DR
Semantic Scholar
Attribution policy →

Abstract

Grid-interactive building control has emerged as a promising approach for improving demand-side flexibility in modern power systems. Realistic studies of such systems, however, require tightly coupled co-simulation across buildings, reinforcement learning (RL), and distribution grids to capture time-varying control dynamics over spatially distributed grid infrastructures. Constructing these workflows remains highly challenging in practice: researchers must coordinate heterogeneous simulators, configure grid environments, synchronize time-varying execution, and maintain consistency across software interfaces and physical constraints. As simulation complexity increases, these requirements become a major bottleneck for rapidly prototyping and studying learning-based energy control systems. In this work, we introduce AutoB2G, an agentic framework for spatio-temporal building-grid co-simulation. AutoB2G formulates simulation construction as a workflow orchestration problem, where natural-language user intents are translated into executable simulation pipelines. The framework integrates building control environments with power-system simulation tools, enabling modular co-simulation under diverse grid settings. To automate workflow construction, we develop an agentic large language model (LLM)-based orchestration framework for scientific simulation. AutoB2G organizes simulation components into a directed acyclic graph (DAG)-structured codebase and employs LLM agents to perform retrieval, composition, execution, verification, and iterative repair of simulation workflows. This allows users to specify high-level simulation tasks while automatically generating complex co-simulation pipelines without manually implementing low-level simulator logic.