0

Mitigating Long-Tail Bias via Prompt-Controlled Diffusion Augmentation

A prompt-controlled diffusion augmentation framework generates synthetic remote-sensing imagery with controlled domain and semantic composition to address long-tailed class imbalance and improve segmentation performance across Urban and Rural domains.

Year
2026
Venue
arXiv 2026
Authors
6
Hosting
Abstract onlyARXIV-DEFAULT

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text
arxiv.org/abs/2602.04749ARXIV-DEFAULT
TL;DR
Semantic Scholar
Attribution policy →

Abstract

Semantic segmentation of high-resolution remote-sensing imagery is critical for urban mapping and land-cover monitoring, yet training data typically exhibits severe long-tailed pixel imbalance. In the dataset LoveDA, this challenge is compounded by an explicit Urban/Rural split with distinct appearance and inconsistent class-frequency statistics across domains. We present a prompt-controlled diffusion augmentation framework that synthesizes paired label--image samples with explicit control of both domain and semantic composition. StageA uses a domain-aware, masked ratio-conditioned discrete diffusion model to generate layouts that satisfy user-specified class-ratio targets while respecting learned co-occurrence structure. StageB translates layouts into photorealistic, domain-consistent images using Stable Diffusion with ControlNet guidance. Mixing the resulting ratio and domain-controlled synthetic pairs with real data yields consistent improvements across multiple segmentation backbones, with gains concentrated on minority classes and improved Urban and Rural generalization, demonstrating controllable augmentation as a practical mechanism to mitigate long-tail bias in remote-sensing segmentation. Source codes, pretrained models, and synthetic datasets are available at https://github.com/Buddhi19/SyntheticGen.git{Github}

Authors

6