Repository logo

Rapid Content-aware Image Style Transfer Using Attention Map Guidance and Diffusion Model

dc.contributor.authorHwang, Jungmin
dc.contributor.supervisorLee, Wonsook
dc.date.accessioned2024-06-25T19:29:27Z
dc.date.available2024-06-25T19:29:27Z
dc.date.issued2024-06-25
dc.description.abstractDespite the considerable interest and successful progress in image generation and editing applications using diffusion models, a critical challenge persists in balancing style transfer and content preservation. Effectively addressing this challenge is crucial for enhancing the overall success and usability of image editing tools that leverage diffusion methodologies, especially text-driven or image-driven. One approach to tackle this challenge is to determine areas with high level of content in a given image. It involves preserving certain areas, containing more information than the rest of the image. To address this, we propose a method that anchors representative points in these areas for both source image and generated image. Self-attention mechanism intentionally selects queries and produces features at these anchor points. Then we employ contrastive learning in a self-supervised manner. This approach enables our method to generate an image that maintains the important content in the given source image while transferring the style. Our proposed method eliminates the need for additional fine-tuning or auxiliary networks. Our method uses conventional diffusion model, but without fine running for content preservation. Normally fine tunning is required additional network therefore ours results speeding up the inference process compared to other diffusion methods. Our experiments showcase the superior performance of our approach, particularly in preserving image content during editing, along with a notable superiority when compared to both other diffusions models and GAN-based models.
dc.identifier.urihttp://hdl.handle.net/10393/46359
dc.identifier.urihttps://doi.org/10.20381/ruor-30416
dc.language.isoen
dc.publisherUniversité d'Ottawa | University of Ottawa
dc.rightsAttribution 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectStyle Transfer
dc.subjectDiffusion Model
dc.subjectAttention Map
dc.subjectContrastive Learning
dc.titleRapid Content-aware Image Style Transfer Using Attention Map Guidance and Diffusion Model
dc.typeThesisen
thesis.degree.disciplineGénie / Engineering
thesis.degree.levelMasters
thesis.degree.nameMASc
uottawa.departmentScience informatique et génie électrique / Electrical Engineering and Computer Science

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
Hwang_Jungmin_2024_Thesis.pdf
Size:
6.54 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
license.txt
Size:
6.65 KB
Format:
Item-specific license agreed upon to submission
Description: