Data Sheet 1_Comparative evaluation of machine learning models for enhancing diagnostic accuracy of otitis media with effusion in children with adenoid hypertrophy.pdf

Background<p>Otitis media with effusion (OME) affects a significant proportion of children with adenoid hypertrophy (AH) and can lead to developmental sequelae when chronic. Current non-invasive screening modalities rely predominantly on acoustic immittance measurements, which demonstrate vari...

Full description

Saved in:
Bibliographic Details
Main Author: Xiaote Zhang (21570542) (author)
Other Authors: Qiaoyi Xie (21570545) (author), Ganggang Wu (6925256) (author)
Published: 2025
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1852019195344060416
author Xiaote Zhang (21570542)
author2 Qiaoyi Xie (21570545)
Ganggang Wu (6925256)
author2_role author
author
author_facet Xiaote Zhang (21570542)
Qiaoyi Xie (21570545)
Ganggang Wu (6925256)
author_role author
dc.creator.none.fl_str_mv Xiaote Zhang (21570542)
Qiaoyi Xie (21570545)
Ganggang Wu (6925256)
dc.date.none.fl_str_mv 2025-06-19T05:28:35Z
dc.identifier.none.fl_str_mv 10.3389/fped.2025.1614495.s001
dc.relation.none.fl_str_mv https://figshare.com/articles/dataset/Data_Sheet_1_Comparative_evaluation_of_machine_learning_models_for_enhancing_diagnostic_accuracy_of_otitis_media_with_effusion_in_children_with_adenoid_hypertrophy_pdf/29363828
dc.rights.none.fl_str_mv CC BY 4.0
info:eu-repo/semantics/openAccess
dc.subject.none.fl_str_mv Foetal Development and Medicine
otitis media with effusion
adenoid hypertrophy
machine learning
random forest
diagnostic accuracy
SHAP analysis
dc.title.none.fl_str_mv Data Sheet 1_Comparative evaluation of machine learning models for enhancing diagnostic accuracy of otitis media with effusion in children with adenoid hypertrophy.pdf
dc.type.none.fl_str_mv Dataset
info:eu-repo/semantics/publishedVersion
dataset
description Background<p>Otitis media with effusion (OME) affects a significant proportion of children with adenoid hypertrophy (AH) and can lead to developmental sequelae when chronic. Current non-invasive screening modalities rely predominantly on acoustic immittance measurements, which demonstrate variable diagnostic performance. Given the urgent need for improved diagnostic methods and extensive characterization of risk factors for OME in AH children, developing diagnostic models represents an efficient strategy to enhance clinical identification accuracy in practice.</p>Objective<p>This study aims to develop and validate an optimal machine learning (ML)-based prediction model for OME in AH children by comparing multiple algorithmic approaches, integrating clinical indicators with acoustic measurements into a widely applicable diagnostic tool.</p>Methods<p>A retrospective analysis was conducted on 847 pediatric patients with AH. Five ML algorithms were developed to identify OME using demographic, clinical, laboratory, and acoustic immittance parameters. The dataset underwent 7:3 stratified partitioning for training and testing cohorts. Within the training cohort, models were initially optimized through randomized grid search with 5-fold cross-validation, followed by comprehensive training with optimized parameters. Model performance was evaluated in the testing cohort using discrimination, calibration, clinical utility metrics, and confusion matrix-derived statistics. The optimal ML model was subsequently analyzed through SHapley Additive exPlanations (SHAP) methodology for interpretability, with sequential ablation testing performed to identify critical predictive variables.</p>Results<p>Among 847 children with AH, 262 (30.9%) were diagnosed with OME. The Random Forest (RF) model demonstrated superior performance with the highest discrimination (area under the receiver operating characteristic curve = 0.919), balanced calibration (Brier score = 0.102), and optimal clinical utility across decision thresholds of 0.4–0.6. Confusion matrix analysis further confirmed RF as the optimal model, achieving 0.875 accuracy and robust inter-rater agreement (Cohen's kappa coefficient = 0.696) in the testing cohort. SHAP analysis identified the adenoid-to-nasopharyngeal ratio as the predominant diagnostic indicator, followed by tympanometric type and history of recurrent respiratory infections.</p>Conclusion<p>An RF-based diagnostic model effectively identifies OME in AH children by integrating anatomical, functional, and inflammatory parameters, providing a clinically applicable tool for enhanced diagnostic accuracy and evidence-based management decisions.</p>
eu_rights_str_mv openAccess
id Manara_e99f436568fd00a628c01326e0831be7
identifier_str_mv 10.3389/fped.2025.1614495.s001
network_acronym_str Manara
network_name_str ManaraRepo
oai_identifier_str oai:figshare.com:article/29363828
publishDate 2025
repository.mail.fl_str_mv
repository.name.fl_str_mv
repository_id_str
rights_invalid_str_mv CC BY 4.0
spelling Data Sheet 1_Comparative evaluation of machine learning models for enhancing diagnostic accuracy of otitis media with effusion in children with adenoid hypertrophy.pdfXiaote Zhang (21570542)Qiaoyi Xie (21570545)Ganggang Wu (6925256)Foetal Development and Medicineotitis media with effusionadenoid hypertrophymachine learningrandom forestdiagnostic accuracySHAP analysisBackground<p>Otitis media with effusion (OME) affects a significant proportion of children with adenoid hypertrophy (AH) and can lead to developmental sequelae when chronic. Current non-invasive screening modalities rely predominantly on acoustic immittance measurements, which demonstrate variable diagnostic performance. Given the urgent need for improved diagnostic methods and extensive characterization of risk factors for OME in AH children, developing diagnostic models represents an efficient strategy to enhance clinical identification accuracy in practice.</p>Objective<p>This study aims to develop and validate an optimal machine learning (ML)-based prediction model for OME in AH children by comparing multiple algorithmic approaches, integrating clinical indicators with acoustic measurements into a widely applicable diagnostic tool.</p>Methods<p>A retrospective analysis was conducted on 847 pediatric patients with AH. Five ML algorithms were developed to identify OME using demographic, clinical, laboratory, and acoustic immittance parameters. The dataset underwent 7:3 stratified partitioning for training and testing cohorts. Within the training cohort, models were initially optimized through randomized grid search with 5-fold cross-validation, followed by comprehensive training with optimized parameters. Model performance was evaluated in the testing cohort using discrimination, calibration, clinical utility metrics, and confusion matrix-derived statistics. The optimal ML model was subsequently analyzed through SHapley Additive exPlanations (SHAP) methodology for interpretability, with sequential ablation testing performed to identify critical predictive variables.</p>Results<p>Among 847 children with AH, 262 (30.9%) were diagnosed with OME. The Random Forest (RF) model demonstrated superior performance with the highest discrimination (area under the receiver operating characteristic curve = 0.919), balanced calibration (Brier score = 0.102), and optimal clinical utility across decision thresholds of 0.4–0.6. Confusion matrix analysis further confirmed RF as the optimal model, achieving 0.875 accuracy and robust inter-rater agreement (Cohen's kappa coefficient = 0.696) in the testing cohort. SHAP analysis identified the adenoid-to-nasopharyngeal ratio as the predominant diagnostic indicator, followed by tympanometric type and history of recurrent respiratory infections.</p>Conclusion<p>An RF-based diagnostic model effectively identifies OME in AH children by integrating anatomical, functional, and inflammatory parameters, providing a clinically applicable tool for enhanced diagnostic accuracy and evidence-based management decisions.</p>2025-06-19T05:28:35ZDatasetinfo:eu-repo/semantics/publishedVersiondataset10.3389/fped.2025.1614495.s001https://figshare.com/articles/dataset/Data_Sheet_1_Comparative_evaluation_of_machine_learning_models_for_enhancing_diagnostic_accuracy_of_otitis_media_with_effusion_in_children_with_adenoid_hypertrophy_pdf/29363828CC BY 4.0info:eu-repo/semantics/openAccessoai:figshare.com:article/293638282025-06-19T05:28:35Z
spellingShingle Data Sheet 1_Comparative evaluation of machine learning models for enhancing diagnostic accuracy of otitis media with effusion in children with adenoid hypertrophy.pdf
Xiaote Zhang (21570542)
Foetal Development and Medicine
otitis media with effusion
adenoid hypertrophy
machine learning
random forest
diagnostic accuracy
SHAP analysis
status_str publishedVersion
title Data Sheet 1_Comparative evaluation of machine learning models for enhancing diagnostic accuracy of otitis media with effusion in children with adenoid hypertrophy.pdf
title_full Data Sheet 1_Comparative evaluation of machine learning models for enhancing diagnostic accuracy of otitis media with effusion in children with adenoid hypertrophy.pdf
title_fullStr Data Sheet 1_Comparative evaluation of machine learning models for enhancing diagnostic accuracy of otitis media with effusion in children with adenoid hypertrophy.pdf
title_full_unstemmed Data Sheet 1_Comparative evaluation of machine learning models for enhancing diagnostic accuracy of otitis media with effusion in children with adenoid hypertrophy.pdf
title_short Data Sheet 1_Comparative evaluation of machine learning models for enhancing diagnostic accuracy of otitis media with effusion in children with adenoid hypertrophy.pdf
title_sort Data Sheet 1_Comparative evaluation of machine learning models for enhancing diagnostic accuracy of otitis media with effusion in children with adenoid hypertrophy.pdf
topic Foetal Development and Medicine
otitis media with effusion
adenoid hypertrophy
machine learning
random forest
diagnostic accuracy
SHAP analysis