The method implemented by the winning team.

<p>Schematic overview of the data processing, feature selection, and prediction modeling workflow. (a) The workflow begins with raw experimental data, including training and challenge datasets from plasma antibody levels, PBMC gene expression, PBMC cell frequency, and plasma cytokine concentra...

Full description

Saved in:
Bibliographic Details
Main Author: Pramod Shinde (20968597) (author)
Other Authors: Lisa Willemsen (12067970) (author), Michael Anderson (3396614) (author), Minori Aoki (20968600) (author), Saonli Basu (28868) (author), Julie G. Burel (14378658) (author), Peng Cheng (153428) (author), Souradipto Ghosh Dastidar (20968603) (author), Aidan Dunleavy (20968606) (author), Tal Einav (2629669) (author), Jamie Forschmiedt (20968609) (author), Slim Fourati (221387) (author), Javier Garcia (709874) (author), William Gibson (3017370) (author), Jason A. Greenbaum (10287146) (author), Leying Guan (4720269) (author), Weikang Guan (20968612) (author), Jeremy P. Gygi (17346673) (author), Brendan Ha (20968615) (author), Joe Hou (20968618) (author), Jason Hsiao (20968621) (author), Yunda Huang (235282) (author), Rick Jansen (198642) (author), Bhargob Kakoty (20968624) (author), Zhiyu Kang (2821910) (author), James J. Kobie (7365761) (author), Mari Kojima (832141) (author), Anna Konstorum (20968627) (author), Jiyeun Lee (20968630) (author), Sloan A. Lewis (11304747) (author), Aixin Li (2332534) (author), Eric F. Lock (4573135) (author), Jarjapu Mahita (807956) (author), Marcus Mendes (20968633) (author), Hailong Meng (19806) (author), Aidan Neher (20968636) (author), Somayeh Nili (20968639) (author), Lars Rønn Olsen (1633549) (author), Shelby Orfield (20968642) (author), James A. Overton (2634355) (author), Nidhi Pai (20968645) (author), Cokie Parker (20968648) (author), Brian Qian (20968651) (author), Mikkel Rasmussen (20968654) (author), Joaquin Reyna (20968657) (author), Eve Richardson (16561164) (author), Sandra Safo (4241968) (author), Josey Sorenson (20968660) (author), Aparna Srinivasan (4479640) (author), Nicola Thrupp (20968663) (author), Rashmi Tippalagama (14378646) (author), Raphael Trevizani (3639214) (author), Steffen Ventz (4594945) (author), Jiuzhou Wang (7818911) (author), Cheng-Chang Wu (20968666) (author), Ferhat Ay (368912) (author), Barry Grant (21679) (author), Steven H. Kleinstein (8219844) (author), Bjoern Peters (19131) (author)
Published: 2025
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:<p>Schematic overview of the data processing, feature selection, and prediction modeling workflow. (a) The workflow begins with raw experimental data, including training and challenge datasets from plasma antibody levels, PBMC gene expression, PBMC cell frequency, and plasma cytokine concentration assays. The common features across these datasets are identified, followed by batch-effect correction and timepoint-wise imputation. (b) Feature selection was performed using various dimension reduction techniques, including LASSO, Ridge, PLS, PCA, and Multiple Co-inertia Analysis (MCIA). MCIA outperformed the other models and was selected for further analysis. MCIA integrates different data types (e.g., X1, X2, X3, X4) and their associated weights (A1, A2, A3, A4) to produce MCIA factors (G) that represent the combined data structure. (c) These MCIA factors were then used in a Linear Mixed Effects (LME) model to predict the outcome. The model was trained on 80% of the data (train set) using 5-fold cross-validation and evaluated on the remaining 20% (test set). The trained model was then applied to the challenge baseline data to generate predictions, which were used to rank subjects according to their predicted outcomes. Figure is created in <a href="https://BioRender.com" target="_blank">https://BioRender.com</a>.</p>