Deep Models for Stroke Segmentation: Do Complex Architectures Always Perform Better?

<p dir="ltr">The accurate segmentation of stroke lesions is crucial for the diagnosis and treatment of stroke patients, as it provides spatial information about affected brain regions and the extent of damage. While conventional manual techniques are time-consuming and prone to error...

Full description

Saved in:
Bibliographic Details
Main Author: Ahmed Soliman (4591621) (author)
Other Authors: Yalda Zafari-Ghadim (22282849) (author), Yousif Yousif (22282852) (author), Ahmed Ibrahim (1771174) (author), Amr Mohamed (3508121) (author), Essam A. Rashed (11949249) (author), Mohamed A. Mabrok (22282855) (author)
Published: 2024
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1864513513500704768
author Ahmed Soliman (4591621)
author2 Yalda Zafari-Ghadim (22282849)
Yousif Yousif (22282852)
Ahmed Ibrahim (1771174)
Amr Mohamed (3508121)
Essam A. Rashed (11949249)
Mohamed A. Mabrok (22282855)
author2_role author
author
author
author
author
author
author_facet Ahmed Soliman (4591621)
Yalda Zafari-Ghadim (22282849)
Yousif Yousif (22282852)
Ahmed Ibrahim (1771174)
Amr Mohamed (3508121)
Essam A. Rashed (11949249)
Mohamed A. Mabrok (22282855)
author_role author
dc.creator.none.fl_str_mv Ahmed Soliman (4591621)
Yalda Zafari-Ghadim (22282849)
Yousif Yousif (22282852)
Ahmed Ibrahim (1771174)
Amr Mohamed (3508121)
Essam A. Rashed (11949249)
Mohamed A. Mabrok (22282855)
dc.date.none.fl_str_mv 2024-12-31T12:00:00Z
dc.identifier.none.fl_str_mv 10.1109/access.2024.3522214
dc.relation.none.fl_str_mv https://figshare.com/articles/journal_contribution/Deep_Models_for_Stroke_Segmentation_Do_Complex_Architectures_Always_Perform_Better_/30173515
dc.rights.none.fl_str_mv CC BY 4.0
info:eu-repo/semantics/openAccess
dc.subject.none.fl_str_mv Biomedical and clinical sciences
Neurosciences
Engineering
Biomedical engineering
Health sciences
Health services and systems
Information and computing sciences
Artificial intelligence
Convolutional neural networks
deep learning
nnU-Net
stroke segmentation
vision Transformer
Image segmentation
Transformers
Computer architecture
Stroke (medical condition)
Feature extraction
Data models
Computer vision
Decoding
Accuracy
dc.title.none.fl_str_mv Deep Models for Stroke Segmentation: Do Complex Architectures Always Perform Better?
dc.type.none.fl_str_mv Text
Journal contribution
info:eu-repo/semantics/publishedVersion
text
contribution to journal
description <p dir="ltr">The accurate segmentation of stroke lesions is crucial for the diagnosis and treatment of stroke patients, as it provides spatial information about affected brain regions and the extent of damage. While conventional manual techniques are time-consuming and prone to errors, advanced deep learning models have shown promising results in medical image segmentation. Recently, several complex architectures, such as vision Transformers and attention-based convolutional neural networks (CNNs), have been introduced for this task. However, the question remains whether such high-level designs are necessary to achieve the best results for all segmentation cases. In this paper, we evaluated the performance of four types of deep models for stroke segmentation: 1) a pure Transformer-based architecture (DAE-Former), 2) two advanced CNN-based models (LKA and DLKA) with attention mechanisms, 3) a hybrid model that incorporates CNNs with Transformers (FCT), and 4) the well-known self-adaptive nnU-Net framework. We examined their performance on two publicly available datasets, ISLES 2022 and ATLAS v2.0, and found that the nnU-Net, with its relatively simple design, achieved the best results among all the models tested. Furthermore, we investigated the impact of an imbalanced distribution of the number of unconnected components in each slice, as a representation of common variability in stroke segmentation. Our findings reveal a potential robustness issue of Transformers to such variability, which may explain their unexpected weak performance. Additionally, the success of nnU-Net underscores the significant impact of pre- and post-processing techniques in enhancing segmentation results, rather than solely focusing on architectural designs. These findings suggest that proposed complex architectures may be task-specific and simpler models with appropriate pre-/post-processing pipeline can be equally or more effective in generalization across different tasks in medical image segmentation.</p><h2>Other Information</h2><p dir="ltr">Published in: IEEE Access<br>License: <a href="https://creativecommons.org/licenses/by/4.0/deed.en" target="_blank">https://creativecommons.org/licenses/by/4.0/</a><br>See article on publisher's website: <a href="https://dx.doi.org/10.1109/access.2024.3522214" target="_blank">https://dx.doi.org/10.1109/access.2024.3522214</a></p>
eu_rights_str_mv openAccess
id Manara2_625b0c75d399fb14f838ea93cd47e77e
identifier_str_mv 10.1109/access.2024.3522214
network_acronym_str Manara2
network_name_str Manara2
oai_identifier_str oai:figshare.com:article/30173515
publishDate 2024
repository.mail.fl_str_mv
repository.name.fl_str_mv
repository_id_str
rights_invalid_str_mv CC BY 4.0
spelling Deep Models for Stroke Segmentation: Do Complex Architectures Always Perform Better?Ahmed Soliman (4591621)Yalda Zafari-Ghadim (22282849)Yousif Yousif (22282852)Ahmed Ibrahim (1771174)Amr Mohamed (3508121)Essam A. Rashed (11949249)Mohamed A. Mabrok (22282855)Biomedical and clinical sciencesNeurosciencesEngineeringBiomedical engineeringHealth sciencesHealth services and systemsInformation and computing sciencesArtificial intelligenceConvolutional neural networksdeep learningnnU-Netstroke segmentationvision TransformerImage segmentationTransformersComputer architectureStroke (medical condition)Feature extractionData modelsComputer visionDecodingAccuracy<p dir="ltr">The accurate segmentation of stroke lesions is crucial for the diagnosis and treatment of stroke patients, as it provides spatial information about affected brain regions and the extent of damage. While conventional manual techniques are time-consuming and prone to errors, advanced deep learning models have shown promising results in medical image segmentation. Recently, several complex architectures, such as vision Transformers and attention-based convolutional neural networks (CNNs), have been introduced for this task. However, the question remains whether such high-level designs are necessary to achieve the best results for all segmentation cases. In this paper, we evaluated the performance of four types of deep models for stroke segmentation: 1) a pure Transformer-based architecture (DAE-Former), 2) two advanced CNN-based models (LKA and DLKA) with attention mechanisms, 3) a hybrid model that incorporates CNNs with Transformers (FCT), and 4) the well-known self-adaptive nnU-Net framework. We examined their performance on two publicly available datasets, ISLES 2022 and ATLAS v2.0, and found that the nnU-Net, with its relatively simple design, achieved the best results among all the models tested. Furthermore, we investigated the impact of an imbalanced distribution of the number of unconnected components in each slice, as a representation of common variability in stroke segmentation. Our findings reveal a potential robustness issue of Transformers to such variability, which may explain their unexpected weak performance. Additionally, the success of nnU-Net underscores the significant impact of pre- and post-processing techniques in enhancing segmentation results, rather than solely focusing on architectural designs. These findings suggest that proposed complex architectures may be task-specific and simpler models with appropriate pre-/post-processing pipeline can be equally or more effective in generalization across different tasks in medical image segmentation.</p><h2>Other Information</h2><p dir="ltr">Published in: IEEE Access<br>License: <a href="https://creativecommons.org/licenses/by/4.0/deed.en" target="_blank">https://creativecommons.org/licenses/by/4.0/</a><br>See article on publisher's website: <a href="https://dx.doi.org/10.1109/access.2024.3522214" target="_blank">https://dx.doi.org/10.1109/access.2024.3522214</a></p>2024-12-31T12:00:00ZTextJournal contributioninfo:eu-repo/semantics/publishedVersiontextcontribution to journal10.1109/access.2024.3522214https://figshare.com/articles/journal_contribution/Deep_Models_for_Stroke_Segmentation_Do_Complex_Architectures_Always_Perform_Better_/30173515CC BY 4.0info:eu-repo/semantics/openAccessoai:figshare.com:article/301735152024-12-31T12:00:00Z
spellingShingle Deep Models for Stroke Segmentation: Do Complex Architectures Always Perform Better?
Ahmed Soliman (4591621)
Biomedical and clinical sciences
Neurosciences
Engineering
Biomedical engineering
Health sciences
Health services and systems
Information and computing sciences
Artificial intelligence
Convolutional neural networks
deep learning
nnU-Net
stroke segmentation
vision Transformer
Image segmentation
Transformers
Computer architecture
Stroke (medical condition)
Feature extraction
Data models
Computer vision
Decoding
Accuracy
status_str publishedVersion
title Deep Models for Stroke Segmentation: Do Complex Architectures Always Perform Better?
title_full Deep Models for Stroke Segmentation: Do Complex Architectures Always Perform Better?
title_fullStr Deep Models for Stroke Segmentation: Do Complex Architectures Always Perform Better?
title_full_unstemmed Deep Models for Stroke Segmentation: Do Complex Architectures Always Perform Better?
title_short Deep Models for Stroke Segmentation: Do Complex Architectures Always Perform Better?
title_sort Deep Models for Stroke Segmentation: Do Complex Architectures Always Perform Better?
topic Biomedical and clinical sciences
Neurosciences
Engineering
Biomedical engineering
Health sciences
Health services and systems
Information and computing sciences
Artificial intelligence
Convolutional neural networks
deep learning
nnU-Net
stroke segmentation
vision Transformer
Image segmentation
Transformers
Computer architecture
Stroke (medical condition)
Feature extraction
Data models
Computer vision
Decoding
Accuracy