Towards Automated Neonatal EEG Analysis: Multi-Center Validation of a Reliable Deep Learning Pipeline
Hermans, T.; Dereymaeker, A.; Lemmens, K.; Jansen, K.; Usman, F.; Robinson, S.; Naulaers, G.; De Vos, M.; Hartley, C.
Show abstract
ObjectiveTo evaluate the reliability and generalization of NeoNaid, a fully automated software tool for neonatal EEG analysis, based on functional brain age (FBA) estimation and sleep staging. MethodsNeoNaid combines a multi-task deep learning model with proposed quality control routines detecting artefacts, out-of-distribution inputs, and uncertain predictions. Based on a raw EEG input, it outputs one global FBA estimate and a continuous 2-state hypnogram. We validated performance on an two independent hospital settings: an internal dataset (33 EEGs, 17 infants, median 900 minutes/recording) and an external dataset (38 EEGs, 24 infants, median 124 minutes/recording). ResultsQuality control rejected comparable number of segments in the internal and external datasets, reducing extreme errors in FBA estimation, and modestly improving sleep staging accuracy. Across the internal and external data, NeoNaid achieved median absolute FBA errors of 0.50 and 0.55 weeks and Cohens Kappa values of 0.89 and 0.87 for quiet sleep detection, respectively. ConclusionsNeoNaid demonstrated improved reliability through integrated quality control and robust generalization across recording setups. SignificanceBy focusing on validation and trustworthiness, this work takes an essential step toward clinical adoption of automated neonatal EEG analysis and supports its utility for both NICU practice and large-scale research.
Matching journals
The top 5 journals account for 50% of the predicted probability mass.