Motion Images with Positioning Information and Deep Learning for Continuous Arabic Sign Language Recognition in Signer Dependent and Independent Modes

While recognition of sign language alphabets and isolated words has matured in recent years, recognition of sign language sentences, or continuous signing, is still a research topic of interest in computer vision, especially in signer independent mode of recognition. Existing state-of-the-art soluti...

Full description

Saved in:

Bibliographic Details
Main Author:	Almaazmi, Mariam (author)
Other Authors:	Elkadi, Salma (author), Elsayed, Lamis (author), Salman, Lina (author), Shanableh, Tamer (author)
Format:	article
Published:	2024
Subjects:	Computer vision Continuous sign language recognition Deep learning Recurrent neural networks Video analysis
Online Access:	https://hdl.handle.net/11073/25705
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1864513442435563520
author	Almaazmi, Mariam
author2	Elkadi, Salma Elsayed, Lamis Salman, Lina Shanableh, Tamer
author2_role	author author author author
author_facet	Almaazmi, Mariam Elkadi, Salma Elsayed, Lamis Salman, Lina Shanableh, Tamer
author_role	author
dc.creator.none.fl_str_mv	Almaazmi, Mariam Elkadi, Salma Elsayed, Lamis Salman, Lina Shanableh, Tamer
dc.date.none.fl_str_mv	2024-10-28T09:47:13Z 2024-10-28T09:47:13Z 2024
dc.identifier.none.fl_str_mv	https://hdl.handle.net/11073/25705 10.1109/access.2024.3485131 2169-3536
dc.language.none.fl_str_mv	en_US
dc.publisher.none.fl_str_mv	Institute of Electrical and Electronics Engineers (IEEE)
dc.relation.none.fl_str_mv	https://doi.org/10.1109/ACCESS.2024.3485131
dc.rights.none.fl_str_mv	https://creativecommons.org/licenses/by-nc-nd/4.0/
dc.source.none.fl_str_mv	IEEE Access 1 1
dc.subject.none.fl_str_mv	Computer vision Continuous sign language recognition Deep learning Recurrent neural networks Video analysis
dc.title.none.fl_str_mv	Motion Images with Positioning Information and Deep Learning for Continuous Arabic Sign Language Recognition in Signer Dependent and Independent Modes
dc.type.none.fl_str_mv	Postprint Peer-Reviewed info:eu-repo/semantics/publishedVersion info:eu-repo/semantics/article
description	While recognition of sign language alphabets and isolated words has matured in recent years, recognition of sign language sentences, or continuous signing, is still a research topic of interest in computer vision, especially in signer independent mode of recognition. Existing state-of-the-art solutions in the continuous Arabic Sign Language Recognition (ArSLR) are promising; however, when implemented in signer independent mode, the accuracies drop noticeably. In this paper, we propose a solution for recognizing continuous Arabic signing in signer dependent and independent modes through the use of motion images with positioning information. Initially, sign videos are converted into several motion images, each emphasizing a different segment of the sentence. This is achieved by a weighted sum of residual images after applying optical flow and motion compensation. Each motion image is composed of the whole sentence video, hence putting the emphasized segment into context in terms of previous and successive sign words. Thereafter, hand-crafted features are calculated from each resultant image, including numerical summaries of the horizontal, vertical and diagonal profiles. With such features, the architecture used for model generation and testing is simplified, where it consists of a single Bi-LSTM layer followed by dropout, softmax, and classification layers. This paper makes use of a recent Arabic Sign Language dataset known as ArabSign. The dataset is composed of 6 signers and 93 words arranged into 50 sentences with 30 repetitions each. Experimental results revealed that the proposed solution is suitable for signer dependent and signer independent modes of continuous sign language recognition. Using a Leave-One-Signer-Out policy, the proposed solution achieved word-recognition rates of 99.8% and 75.3%, respectively. These results noticeably surpass relevant state-of-the-art solutions.
format	article
id	aus_605527dffbc3787e21fca7784d73904e
identifier_str_mv	10.1109/access.2024.3485131 2169-3536
language_invalid_str_mv	en_US
network_acronym_str	aus
network_name_str	aus
oai_identifier_str	oai:repository.aus.edu:11073/25705
publishDate	2024
publisher.none.fl_str_mv	Institute of Electrical and Electronics Engineers (IEEE)
repository.mail.fl_str_mv
repository.name.fl_str_mv
repository_id_str
rights_invalid_str_mv	https://creativecommons.org/licenses/by-nc-nd/4.0/
spelling	Motion Images with Positioning Information and Deep Learning for Continuous Arabic Sign Language Recognition in Signer Dependent and Independent ModesAlmaazmi, MariamElkadi, SalmaElsayed, LamisSalman, LinaShanableh, TamerComputer visionContinuous sign language recognitionDeep learningRecurrent neural networksVideo analysisWhile recognition of sign language alphabets and isolated words has matured in recent years, recognition of sign language sentences, or continuous signing, is still a research topic of interest in computer vision, especially in signer independent mode of recognition. Existing state-of-the-art solutions in the continuous Arabic Sign Language Recognition (ArSLR) are promising; however, when implemented in signer independent mode, the accuracies drop noticeably. In this paper, we propose a solution for recognizing continuous Arabic signing in signer dependent and independent modes through the use of motion images with positioning information. Initially, sign videos are converted into several motion images, each emphasizing a different segment of the sentence. This is achieved by a weighted sum of residual images after applying optical flow and motion compensation. Each motion image is composed of the whole sentence video, hence putting the emphasized segment into context in terms of previous and successive sign words. Thereafter, hand-crafted features are calculated from each resultant image, including numerical summaries of the horizontal, vertical and diagonal profiles. With such features, the architecture used for model generation and testing is simplified, where it consists of a single Bi-LSTM layer followed by dropout, softmax, and classification layers. This paper makes use of a recent Arabic Sign Language dataset known as ArabSign. The dataset is composed of 6 signers and 93 words arranged into 50 sentences with 30 repetitions each. Experimental results revealed that the proposed solution is suitable for signer dependent and signer independent modes of continuous sign language recognition. Using a Leave-One-Signer-Out policy, the proposed solution achieved word-recognition rates of 99.8% and 75.3%, respectively. These results noticeably surpass relevant state-of-the-art solutions.American University of SharjahInstitute of Electrical and Electronics Engineers (IEEE)2024-10-28T09:47:13Z2024-10-28T09:47:13Z2024PostprintPeer-Reviewedinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttps://hdl.handle.net/11073/2570510.1109/access.2024.34851312169-3536IEEE Access11en_UShttps://doi.org/10.1109/ACCESS.2024.3485131https://creativecommons.org/licenses/by-nc-nd/4.0/oai:repository.aus.edu:11073/257052024-10-28T11:44:35Z
spellingShingle	Motion Images with Positioning Information and Deep Learning for Continuous Arabic Sign Language Recognition in Signer Dependent and Independent Modes Almaazmi, Mariam Computer vision Continuous sign language recognition Deep learning Recurrent neural networks Video analysis
status_str	publishedVersion
title	Motion Images with Positioning Information and Deep Learning for Continuous Arabic Sign Language Recognition in Signer Dependent and Independent Modes
title_full	Motion Images with Positioning Information and Deep Learning for Continuous Arabic Sign Language Recognition in Signer Dependent and Independent Modes
title_fullStr	Motion Images with Positioning Information and Deep Learning for Continuous Arabic Sign Language Recognition in Signer Dependent and Independent Modes
title_full_unstemmed	Motion Images with Positioning Information and Deep Learning for Continuous Arabic Sign Language Recognition in Signer Dependent and Independent Modes
title_short	Motion Images with Positioning Information and Deep Learning for Continuous Arabic Sign Language Recognition in Signer Dependent and Independent Modes
title_sort	Motion Images with Positioning Information and Deep Learning for Continuous Arabic Sign Language Recognition in Signer Dependent and Independent Modes
topic	Computer vision Continuous sign language recognition Deep learning Recurrent neural networks Video analysis
url	https://hdl.handle.net/11073/25705

Motion Images with Positioning Information and Deep Learning for Continuous Arabic Sign Language Recognition in Signer Dependent and Independent Modes

Similar Items