UB Paderborn / Katalog / Suche / Details

Zur Ergebnisliste

Ergebnis 5 von 136

On Enhancing Crack Semantic Segmentation using StyleGAN and Brownian Bridge Diffusion

IEEE access, 2024-01, Vol.12, p.1-1

2024

Details

Autor(en) / Beteiligte

Titel

On Enhancing Crack Semantic Segmentation using StyleGAN and Brownian Bridge Diffusion

Ist Teil von

IEEE access, 2024-01, Vol.12, p.1-1

Ort / Verlag

Piscataway: IEEE

Erscheinungsjahr

2024

Link zum Volltext

Quelle

EZB Electronic Journals Library

Beschreibungen/Notizen

Inspection for cracks is an essential yet labor-intensive aspect of maintenance for structures in active service bridges. Deep learning networks, combined with an abundance of segmented image data representing various types of cracks, enable the development of a computer vision-based solution. Often, segmentation data is scarce and requires a great deal of time to annotate. This paper introduces a novel approach to structural crack detection using synthetic data generation and advanced semantic segmentation models. We employ StyleGAN3 and the Brownian Bridge Diffusion Model (BBDM) to create a diverse and realistic dataset of synthetic structural crack images, addressing the critical challenge of creating segmentation data in training deep learning models for crack detection. Our methodology is based upon the DeepLabv3+, i.e., a semantic segmentation architecture that builds on DeepLabv3 by adding a simple yet effective decoder module to enhance segmentation results. The original DeepLabv3+ model is insufficient and thus, we first perform a meticulous hyperparameter tuning, which is responsible for about a 10% increase in overall performance. Next, we generate multiple image-to-image translations with BBDMs synthetic datasets and pair them with a set of fine-selected data augmentation techniques, including motion, zoom, and defocus blur, to improve crack segmentation performance. When compared to the state-of-the-art latest work on the same database that achieved an accuracy of 61.49%, our proposed work attains a Mean Intersection over Union (MeanIoU) accuracy of 65.62% through ensemble modeling on multiple synthesized datasets, employing a majority voting strategy. We also showcase the potential of diffusion models in synthetically generated datasets that elevate semantic segmentation accuracy and introduce blur augmentation as a viable technique for enhancing model robustness. The results indicate that our approach not only surpasses conventional methods in terms of MeanIoU but also offers a new avenue of research into diffusion-model-based synthetic image generation for improved semantic segmentation performance.

Sprache: Englisch
Identifikatoren: eISSN: 2169-3536
DOI: 10.1109/ACCESS.2024.3368376
Titel-ID: cdi_proquest_journals_2947823185

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX

Menü

Weitere Dienste

Einstellungen

On Enhancing Crack Semantic Segmentation using StyleGAN and Brownian Bridge Diffusion

Details

Weiterführende Literatur