Reusing Computation in Text-to-Image Diffusion for Efficient Generation of Image Sets
View PDF
HTML (experimental)
Abstract:Text-to-image diffusion models enable high-quality image generation but are computationally expensive. While prior work optimizes per-inference efficiency, we explore an orthogonal approach: reducing redundancy across correlated prompts. Our method leverages the coarse-to-fine nature of diffusion models, where early denoising steps capture shared structures among similar prompts. We propose a training-free approach that clusters prompts based on semantic sim...
Read more at arxiv.org