Data Sheet 1_Comparative evaluation of machine learning models for enhancing diagnostic accuracy of otitis media with effusion in children with adenoid hypertrophy.pdf

Background<p>Otitis media with effusion (OME) affects a significant proportion of children with adenoid hypertrophy (AH) and can lead to developmental sequelae when chronic. Current non-invasive screening modalities rely predominantly on acoustic immittance measurements, which demonstrate vari...

وصف كامل

محفوظ في:
التفاصيل البيبلوغرافية
المؤلف الرئيسي: Xiaote Zhang (21570542) (author)
مؤلفون آخرون: Qiaoyi Xie (21570545) (author), Ganggang Wu (6925256) (author)
منشور في: 2025
الموضوعات:
الوسوم: إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
الوصف
الملخص:Background<p>Otitis media with effusion (OME) affects a significant proportion of children with adenoid hypertrophy (AH) and can lead to developmental sequelae when chronic. Current non-invasive screening modalities rely predominantly on acoustic immittance measurements, which demonstrate variable diagnostic performance. Given the urgent need for improved diagnostic methods and extensive characterization of risk factors for OME in AH children, developing diagnostic models represents an efficient strategy to enhance clinical identification accuracy in practice.</p>Objective<p>This study aims to develop and validate an optimal machine learning (ML)-based prediction model for OME in AH children by comparing multiple algorithmic approaches, integrating clinical indicators with acoustic measurements into a widely applicable diagnostic tool.</p>Methods<p>A retrospective analysis was conducted on 847 pediatric patients with AH. Five ML algorithms were developed to identify OME using demographic, clinical, laboratory, and acoustic immittance parameters. The dataset underwent 7:3 stratified partitioning for training and testing cohorts. Within the training cohort, models were initially optimized through randomized grid search with 5-fold cross-validation, followed by comprehensive training with optimized parameters. Model performance was evaluated in the testing cohort using discrimination, calibration, clinical utility metrics, and confusion matrix-derived statistics. The optimal ML model was subsequently analyzed through SHapley Additive exPlanations (SHAP) methodology for interpretability, with sequential ablation testing performed to identify critical predictive variables.</p>Results<p>Among 847 children with AH, 262 (30.9%) were diagnosed with OME. The Random Forest (RF) model demonstrated superior performance with the highest discrimination (area under the receiver operating characteristic curve = 0.919), balanced calibration (Brier score = 0.102), and optimal clinical utility across decision thresholds of 0.4–0.6. Confusion matrix analysis further confirmed RF as the optimal model, achieving 0.875 accuracy and robust inter-rater agreement (Cohen's kappa coefficient = 0.696) in the testing cohort. SHAP analysis identified the adenoid-to-nasopharyngeal ratio as the predominant diagnostic indicator, followed by tympanometric type and history of recurrent respiratory infections.</p>Conclusion<p>An RF-based diagnostic model effectively identifies OME in AH children by integrating anatomical, functional, and inflammatory parameters, providing a clinically applicable tool for enhanced diagnostic accuracy and evidence-based management decisions.</p>