Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...
IEEE transaction on neural networks and learning systems, 2024-04, Vol.PP, p.1-10
2024

Details

Autor(en) / Beteiligte
Titel
Understanding Double Descent Using VC-Theoretical Framework
Ist Teil von
  • IEEE transaction on neural networks and learning systems, 2024-04, Vol.PP, p.1-10
Ort / Verlag
United States: IEEE
Erscheinungsjahr
2024
Link zum Volltext
Quelle
IEEE/IET Electronic Library (IEL)
Beschreibungen/Notizen
  • In spite of many successful applications of deep learning (DL) networks, theoretical understanding of their generalization capabilities and limitations remains limited. We present analysis of generalization performance of DL networks for classification under VC-theoretical framework. In particular, we analyze the so-called "double descent" phenomenon, when large overparameterized networks can generalize well, even when they perfectly memorize all available training data. This appears to contradict conventional statistical view that optimal model complexity should reflect an optimal balance between underfitting and overfitting, i.e., the bias-variance trade-off. We present VC-theoretical explanation of double descent phenomenon, under classification setting. Our theoretical explanation is supported by empirical modeling of double descent curves, using analytic VC-bounds, for several learning methods, such as support vector machine (SVM), least squares (LS), and multilayer perceptron classifiers. The proposed VC-theoretical approach enables better understanding of overparameterized estimators during second descent.
Sprache
Englisch
Identifikatoren
ISSN: 2162-237X
eISSN: 2162-2388
DOI: 10.1109/TNNLS.2024.3388873
Titel-ID: cdi_ieee_primary_10508981

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX