UB Paderborn / Katalog / Suche / Details

IEEE transaction on neural networks and learning systems, 2024-04, Vol.PP, p.1-10

2024

Autor(en) / Beteiligte

Titel

Understanding Double Descent Using VC-Theoretical Framework

Ist Teil von

IEEE transaction on neural networks and learning systems, 2024-04, Vol.PP, p.1-10

Ort / Verlag

United States: IEEE

Erscheinungsjahr

2024

Link zum Volltext

Quelle

IEEE/IET Electronic Library (IEL)

Beschreibungen/Notizen

In spite of many successful applications of deep learning (DL) networks, theoretical understanding of their generalization capabilities and limitations remains limited. We present analysis of generalization performance of DL networks for classification under VC-theoretical framework. In particular, we analyze the so-called "double descent" phenomenon, when large overparameterized networks can generalize well, even when they perfectly memorize all available training data. This appears to contradict conventional statistical view that optimal model complexity should reflect an optimal balance between underfitting and overfitting, i.e., the bias-variance trade-off. We present VC-theoretical explanation of double descent phenomenon, under classification setting. Our theoretical explanation is supported by empirical modeling of double descent curves, using analytic VC-bounds, for several learning methods, such as support vector machine (SVM), least squares (LS), and multilayer perceptron classifiers. The proposed VC-theoretical approach enables better understanding of overparameterized estimators during second descent.

Sprache: Englisch
Identifikatoren: ISSN: 2162-237X
eISSN: 2162-2388
DOI: 10.1109/TNNLS.2024.3388873
Titel-ID: cdi_ieee_primary_10508981

Format: –
Schlagworte: Complexity control, Complexity theory, Convergence, Data models, deep learning (DL), double descent, generalization bounds, networks with random weights, structural risk minimization (SRM), Support vector machines, Training, Training data, Upper bound, VC-dimension

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX