UB Paderborn / Katalog / Suche / Details

Zur Ergebnisliste

Ergebnis 4 von 62

Post-Training Quantization on Diffusion Models

2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023, p.1972-1981

2023

Volltextzugriff (PDF)

Details

Autor(en) / Beteiligte

Titel

Post-Training Quantization on Diffusion Models

Ist Teil von

2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023, p.1972-1981

Ort / Verlag

IEEE

Erscheinungsjahr

2023

Quelle

IEEE Electronic Library Online

Beschreibungen/Notizen

Denoising diffusion (score-based) generative models have recently achieved significant accomplishments in generating realistic and diverse data. Unfortunately, the generation process of current denoising diffusion models is notoriously slow due to the lengthy iterative noise estimations, which rely on cumbersome neural networks. It prevents the diffusion models from being widely deployed, especially on edge devices. Previous works accelerate the generation process of diffusion model (DM) via finding shorter yet effective sampling trajectories. However, they overlook the cost of noise estimation with a heavy network in every iteration. In this work, we accelerate generation from the perspective of compressing the noise estimation network. Due to the difficulty of retraining DMs, we exclude mainstream training-aware compression paradigms and introduce post-training quantization (PTQ) into DM acceleration. However, the output distributions of noise estimation networks change with time-step, making previous PTQ methods fail in DMs since they are designed for single-time step scenarios. To devise a DM-specific PTQ method, we explore PTQ on DM in three aspects: quantized operations, calibration dataset, and calibration metric. We summarize and use several observations derived from all-inclusive investigations to formulate our method, which especially targets the unique multi-time-step structure of DMs. Experimentally, our method can directly quantize full-precision DMs into 8-bit models while maintaining or even improving their performance in a training-free manner. Importantly, our method can serve as a plug-and-play module on other fast-sampling methods, e.g., DDIM [24]. The code is available at https://https://github.com/42Shawn/PTQ4DM.

Sprache: Englisch
Identifikatoren: eISSN: 2575-7075
DOI: 10.1109/CVPR52729.2023.00196
Titel-ID: cdi_ieee_primary_10204009

Format: –
Schlagworte: Calibration, Computational modeling, Efficient and scalable vision, Estimation, Measurement, Neural networks, Noise reduction, Quantization (signal)

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX

Menü

Weitere Dienste

Einstellungen

Post-Training Quantization on Diffusion Models

Details

Weiterführende Literatur