FDS: Frequency-Aware Denoising Score for Text-Guided Latent Diffusion Image Editing
Paper i proceeding, 2025

Text-guided image editing using Text-to-Image (T2I) models often fails to yield satisfactory results, frequently introducing unintended modifications, such as the loss of local detail and color changes. In this paper, we analyze these failure cases and attribute them to the indiscriminate optimization across all frequency bands, even though only specific frequencies may require adjustment. To address this, we introduce a simple yet effective approach that enables the selective optimization of specific frequency bands within localized spatial regions for precise edits. Our method leverages wavelets to decompose images into different spatial resolutions across multiple frequency bands, enabling precise modifications at various levels of detail. To extend the applicability of our approach, we provide a comparative analysis of different frequency-domain techniques. Additionally, we extend our method to 3D texture editing by performing frequency decomposition on the triplane representation, enabling frequency-aware adjustments for 3D textures. Quantitative evaluations and user studies demonstrate the effectiveness of our method in producing high-quality and precise edits. Further details are available on our project website: https://ivrl.github.io/fds-webpage/

Text-to-Image

3D texture editing

Diffusion

Score Distillation Sampling

Författare

Yufan Ren

Ecole Polytechnique Federale de Lausanne (EPFL)

Zicong Jiang

Danmarks Tekniske Universitet (DTU)

Chalmers, Elektroteknik, Kommunikation, Antenner och Optiska Nätverk

Søren Forchhammer

Danmarks Tekniske Universitet (DTU)

Sabine Süsstrunk

Ecole Polytechnique Federale de Lausanne (EPFL)

Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

10636919 (ISSN)


979-8-3315-4364-8 (ISBN)

2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Nashville, USA,

Styrkeområden

Informations- och kommunikationsteknik

Ämneskategorier (SSIF 2025)

Datorgrafik och datorseende

Datavetenskap (datalogi)

Artificiell intelligens

DOI

10.1109/CVPR52734.2025.00253

Mer information

Senast uppdaterat

2026-06-01