ORIGINAL RESEARCH article

Front. Neurol.

Sec. Pediatric Neurology

Minimum Data Requirements and Automated Preprocessing for Reliable EEG Biomarkers in Rett Syndrome

  • 1. The Children's Hospital of Philadelphia, Philadelphia, United States

  • 2. University of Pennsylvania Perelman School of Medicine, Philadelphia, United States

The final, formatted version of the article will be published soon.

Abstract

Background: Electroencephalography (EEG) is a promising biomarker for Rett syndrome (RTT), but excessive artifact and variable tolerance for longer recording sessions pose challenges for reliable biomarker development. Establishing an automated preprocessing pipeline that matches human review and defining the minimum data needed for stable quantitative EEG (qEEG) features can support more patient-friendly protocols and provide consistent multisite analysis results. Methods: A mean of 10 minutes of resting-state EEG from 117 participants (1-18 year old; 236 sessions) in the multisite R61 RTT study was processed using a fully automated, correction-based preprocessing pipeline incorporating artifact handling, adaptive channel rejection, ASR, and ICA-based cleaning. Spectral power was extracted from artifact-free 4-second epochs. The proposed pipeline is validated using an established rejection-based pipeline. Feature stability as a function of cumulative data length was then assessed using two complementary frameworks: a Statistical Convergence approach and a Model-Based Inflection approach, and potential systemic dependencies were evaluated using permutation analyses. The relationship between clinical measures was also assessed. Results: The correction-based pipeline retained substantially more data than the rejection-based workflow (mean retention = 95.0% vs 28.4%; p < 0.001) while preserving strong feature correspondence across frequency bands. Stable power estimates were achieved after 19-34 epochs (≈ 76–136 seconds). Based on permutation analysis, there was no statistically significant difference in minimum stabilization threshold between RTT and TD. However, the RTT group exhibited higher rates of intrinsic signal instability than typically developing (TD) controls. Age-stratified analysis revealed that the minimum epochs did not significantly differ between age groups. Spectral associations with clinical severity were preserved when using only the minimum data required for stability, as well as in an ecologically valid scenario of truncating the raw EEG up to minimum epoch recommendation and reprocessing it. Conclusions: With the proposed correction-based pipeline, approximately 3 minutes of raw resting-state EEG are sufficient to obtain stable and clinically meaningful spectral features in children with Rett Syndrome. These findings support shorter, more feasible EEG acquisitions and provide a reproducible framework for data sufficiency in multisite neurodevelopmental studies.

Summary

Keywords

automated preprocessing, biomarkers, Data sufficiency, EEG, Rett Syndrome

Received

20 January 2026

Accepted

22 May 2026

Copyright

© 2026 Oh, Campbell, Shults, Saby and Marsh. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Eric Marsh

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Share article

Article metrics