Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...
Ergebnis 23 von 2717676
ACM transactions on graphics, 2021-07, Vol.40 (4), p.1-14, Article 133
2021

Details

Autor(en) / Beteiligte
Titel
Designing an encoder for StyleGAN image manipulation
Ist Teil von
  • ACM transactions on graphics, 2021-07, Vol.40 (4), p.1-14, Article 133
Ort / Verlag
New York, NY, USA: ACM
Erscheinungsjahr
2021
Link zum Volltext
Quelle
Alma/SFX Local Collection
Beschreibungen/Notizen
  • Recently, there has been a surge of diverse methods for performing image editing by employing pre-trained unconditional generators. Applying these methods on real images, however, remains a challenge, as it necessarily requires the inversion of the images into their latent space. To successfully invert a real image, one needs to find a latent code that reconstructs the input image accurately, and more importantly, allows for its meaningful manipulation. In this paper, we carefully study the latent space of StyleGAN, the state-of-the-art unconditional generator. We identify and analyze the existence of a distortion-editability tradeoff and a distortion-perception tradeoff within the StyleGAN latent space. We then suggest two principles for designing encoders in a manner that allows one to control the proximity of the inversions to regions that StyleGAN was originally trained on. We present an encoder based on our two principles that is specifically designed for facilitating editing on real images by balancing these tradeoffs. By evaluating its performance qualitatively and quantitatively on numerous challenging domains, including cars and horses, we show that our inversion method, followed by common editing techniques, achieves superior real-image editing quality, with only a small reconstruction accuracy drop.
Sprache
Englisch
Identifikatoren
ISSN: 0730-0301
eISSN: 1557-7368
DOI: 10.1145/3450626.3459838
Titel-ID: cdi_crossref_primary_10_1145_3450626_3459838

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX