0

Using Style Ambiguity Loss to Improve Aesthetics of Diffusion Models

Style ambiguity loss improves the creativity and image quality of diffusion models and GANs without the need for a classifier or labeled dataset.

Year
2024
Venue
arXiv 2024
Authors
1
Hosting
Abstract onlyARXIV-DEFAULT

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text
arxiv.org/abs/2410.02055ARXIV-DEFAULT
TL;DR
Semantic Scholar
Attribution policy →

Abstract

Teaching text-to-image models to be creative involves using style ambiguity loss. In this work, we explore using the style ambiguity training objective, used to approximate creativity, on a diffusion model. We then experiment with forms of style ambiguity loss that do not require training a classifier or a labeled dataset, and find that the models trained with style ambiguity loss can generate better images than the baseline diffusion models and GANs. Code is available at https://github.com/jamesBaker361/clipcreate.

Authors

1