논문 제목 : Adaptive Classifier-Free Guidance for Robust Image-to-Image Translation
논문 요약 : Text-guided image-to-image translation aims to edit a source image according to a textual prompt while preserving its structure. However, existing approaches often rely on a fixed classifier-free guidance scale and a single prompt input, leading to unstable results and a poor balance between semantic fidelity and structural preservation. In this work, we propose a unified framework that improves both the controllability and stability of text-driven diffusion editing without requiring fine-tuning or paired training data. Our method introduces two key components: (1) an adaptive guidance scheduler that dynamically modulates the classifier-free guidance scale over timesteps based on the input image and prompt, and (2) a prompt ensemble mechanism that generates and ranks multiple semantically aligned prompt variants to mitigate prompt sensitivity. Together, these components form a plug-and-play framework that significantly improves editing consistency and visual quality. Extensive experiments on NuScenes, AFHQ, and CelebA demonstrate that our method consistently outperforms existing approaches across diverse scenarios.
학위연월 : 2026년 8월
E-mail: sonbongguk5@gmail.com
지도교수: 전상률 교수님
키워드: Computer Vision (CV), Diffusion Model, Text-guided image editing, Classifier-Free Guidance, Image Synthesis
학위논문 소개 웹페이지 URL:https://github.com/zespy5/AdaCFG