ORIGINAL RESEARCH article
Front. Neurol.
Sec. Pediatric Neurology
Minimum Data Requirements and Automated Preprocessing for Reliable EEG Biomarkers in Rett Syndrome
- YO
Yongtaek Oh 1
- KC
Kathleen Campbell 1
- JS
Justine Shults 2
- JS
Joni Saby 1
- EM
Eric Marsh 1,2
1. The Children's Hospital of Philadelphia, Philadelphia, United States
2. University of Pennsylvania Perelman School of Medicine, Philadelphia, United States
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Abstract
Background: Electroencephalography (EEG) is a promising biomarker for Rett syndrome (RTT), but excessive artifact and variable tolerance for longer recording sessions pose challenges for reliable biomarker development. Establishing an automated preprocessing pipeline that matches human review and defining the minimum data needed for stable quantitative EEG (qEEG) features can support more patient-friendly protocols and provide consistent multisite analysis results. Methods: A mean of 10 minutes of resting-state EEG from 117 participants (1-18 year old; 236 sessions) in the multisite R61 RTT study was processed using a fully automated, correction-based preprocessing pipeline incorporating artifact handling, adaptive channel rejection, ASR, and ICA-based cleaning. Spectral power was extracted from artifact-free 4-second epochs. The proposed pipeline is validated using an established rejection-based pipeline. Feature stability as a function of cumulative data length was then assessed using two complementary frameworks: a Statistical Convergence approach and a Model-Based Inflection approach, and potential systemic dependencies were evaluated using permutation analyses. The relationship between clinical measures was also assessed. Results: The correction-based pipeline retained substantially more data than the rejection-based workflow (mean retention = 95.0% vs 28.4%; p < 0.001) while preserving strong feature correspondence across frequency bands. Stable power estimates were achieved after 19-34 epochs (≈ 76–136 seconds). Based on permutation analysis, there was no statistically significant difference in minimum stabilization threshold between RTT and TD. However, the RTT group exhibited higher rates of intrinsic signal instability than typically developing (TD) controls. Age-stratified analysis revealed that the minimum epochs did not significantly differ between age groups. Spectral associations with clinical severity were preserved when using only the minimum data required for stability, as well as in an ecologically valid scenario of truncating the raw EEG up to minimum epoch recommendation and reprocessing it. Conclusions: With the proposed correction-based pipeline, approximately 3 minutes of raw resting-state EEG are sufficient to obtain stable and clinically meaningful spectral features in children with Rett Syndrome. These findings support shorter, more feasible EEG acquisitions and provide a reproducible framework for data sufficiency in multisite neurodevelopmental studies.
Summary
Keywords
automated preprocessing, biomarkers, Data sufficiency, EEG, Rett Syndrome
Received
20 January 2026
Accepted
22 May 2026
Copyright
© 2026 Oh, Campbell, Shults, Saby and Marsh. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Eric Marsh
Disclaimer
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.