TY - JOUR
T1 - Fitness landscape analysis of convolutional neural network architectures for image classification
AU - Rodrigues, Nuno M.
AU - Malan, Katherine M.
AU - Ochoa, Gabriela
AU - Vanneschi, Leonardo
AU - Silva, Sara
N1 - info:eu-repo/grantAgreement/FCT/3599-PPCDT/DSAIPA%2FDS%2F0113%2F2019/PT#
info:eu-repo/grantAgreement/FCT/3599-PPCDT/DSAIPA%2FDS%2F0022%2F2018/PT#
info:eu-repo/grantAgreement/FCT/3599-PPCDT/PTDC%2FCCI-INF%2F29168%2F2017/PT#
info:eu-repo/grantAgreement/FCT/6817 - DCRRNI ID/UIDB%2F04152%2F2020/PT#
Rodrigues, N. M., Malan, K. M., Ochoa, G., Vanneschi, L., & Silva, S. (2022). Fitness landscape analysis of convolutional neural network architectures for image classification. Information Sciences, 609(September), 711-726. https://doi.org/10.1016/j.ins.2022.07.040. ----- Funding: The work of K.M. Malan was supported by the National Research Foundation, South Africa, under Grant 120837. This work was partially supported by FCT, Portugal, through funding of the LASIGE Research Unit (UIDB/00408/2020 and UIDP/00408/2020); projects GADgET (DSAIPA/ DS/ 0022/ 2018), BINDER (PTDC/ CCI-INF/ 29168/ 2017), AICE (DSAIPA/ DS/ 0113/ 2019); Nuno Rodrigues was supported by PhD Grant 2021/05322/BD.
PY - 2022/9/1
Y1 - 2022/9/1
N2 - The global structure of the hyperparameter spaces of neural networks is not well understood and it is therefore not clear which hyperparameter search algorithm will be most effective. In this paper we analyze the landscapes of convolutional neural network architecture search spaces to provide insight into appropriate search algorithms for these spaces. Using a classical fitness landscape analysis approach (fitness distance correlation) and a more recent tool (local optima networks) we study the global structure of these spaces. Our analysis on six image classification datasets reveals that the landscapes are multi-modal, but with relatively few local optima from which it is not hard to escape with a simple perturbation operator. This led us to explore the performance of iterated local search, which we found to more effectively search the training landscapes than three evolutionary algorithm variants. Evolutionary algorithms, however, outperformed iterated local search in terms of generalization on problems with larger discrepancies between the training and testing landscapes.
AB - The global structure of the hyperparameter spaces of neural networks is not well understood and it is therefore not clear which hyperparameter search algorithm will be most effective. In this paper we analyze the landscapes of convolutional neural network architecture search spaces to provide insight into appropriate search algorithms for these spaces. Using a classical fitness landscape analysis approach (fitness distance correlation) and a more recent tool (local optima networks) we study the global structure of these spaces. Our analysis on six image classification datasets reveals that the landscapes are multi-modal, but with relatively few local optima from which it is not hard to escape with a simple perturbation operator. This led us to explore the performance of iterated local search, which we found to more effectively search the training landscapes than three evolutionary algorithm variants. Evolutionary algorithms, however, outperformed iterated local search in terms of generalization on problems with larger discrepancies between the training and testing landscapes.
KW - Neural architecture search
KW - Convolutional neural networks
KW - Fitness distance correlation
KW - Local optima networks
KW - Loss landscapes
UR - http://www.scopus.com/inward/record.url?scp=85134890844&partnerID=8YFLogxK
UR - https://www.webofscience.com/wos/woscc/full-record/WOS:000848168800007
U2 - 10.1016/j.ins.2022.07.040
DO - 10.1016/j.ins.2022.07.040
M3 - Article
VL - 609
SP - 711
EP - 726
JO - Information Sciences
JF - Information Sciences
SN - 0020-0255
IS - September
ER -