Pytorch實作系列 — Neural Doodle

4 min readAug 9, 2021

Neural doodle 由 Alex J. Champandard (2016, nucl.ai)在論文 Semantic Style Transfer and Turning Two-Bit Doodles into Fine Artworks 提出，主要用途是透過patch-based style transfer將風格遷移到使用者繪製的模板。

如果不了解風格遷移的讀者，可參考

Pytorch實作系列 —Neural style transfer

圖像風格遷移由Gatys et al.(2016)提出，是最早使圖片能變換成另一張圖片風格的深度學習技術之一。

acrocanthosaurus627.medium.com

網路架構

相較於原本的架構，相當於多了一個附屬網路處理模板(mask)和風格及內容的特徵。

訓練

風格遷移有很多不同的實現方式，其中Gram-based 和 patch-based兩個方法為主，這兩種都不需要額外的資料作訓練。

Gram-based的方法使用Gram matrix計算風格損失，而patch-based方法採用最近鄰的patch計算風格損失，這類方法也被稱為MRF prior(馬可夫隨機場先驗)。

patch-based 方法需要使用多尺度訓練來產生較佳的結果，這麼做相當於讓模型從小畫布開始繪圖，當模型覺得已經畫得很好時，再給它一張更大的畫布

比起一開始就用高解析度繪圖提供更好的起始點
擴大的畫布可以進一步加強較小畫布的結果

實作方式如同論文擴大畫布的方法是上採樣2倍，可以試試resize說不定不會如本文得到預料之外的結果。

內容與風格

評估

沒有muliscale: 勉強有些樣子，但圖片都是噪音。樹木剛好有五根。

加入muliscale: 你不要看不起原稿就超展開呀~~

代碼連結

GitHub - gitE0Z9/classical-network-series

Contribute to gitE0Z9/classical-network-series development by creating an account on GitHub.

github.com

參考

GitHub - paulwarkentin/pytorch-neural-doodle: An implementation of the Semantic Style Transfer in…

The aim of this project is to implement a generative neural network to turn doodles into fine artworks. This is done by…

github.com

Semantic Style Transfer and Turning Two-Bit Doodles into Fine Artworks

Convolutional neural networks (CNNs) have proven highly effective at image synthesis and style transfer. For most…

arxiv.org

Combining Markov Random Fields and Convolutional Neural Networks for Image Synthesis

This paper studies a combination of generative Markov random field (MRF) models and discriminatively trained deep…