ON OPTICAL CHARACTER RECOGNITION OF ARABIC TEXT

Although, optical character recognition has made tremendous achievements in the area of desktop publishing, yet a huge amount of work is required to be done. Unlike Roman like languages, there are various languages possessing a large number of fonts and/or having complicated shapes. Arabic language...

وصف كامل

محفوظ في:
التفاصيل البيبلوغرافية
المؤلف الرئيسي: Zidouri1, Abdelmalek (author)
مؤلفون آخرون: Sarfraz, Muhammad (author), unknown (author)
التنسيق: article
منشور في: 2020
الوصول للمادة أونلاين:https://eprints.kfupm.edu.sa/id/eprint/1595/1/P109.pdf
الوسوم: إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
_version_ 1864513400586895360
author Zidouri1, Abdelmalek
author2 Sarfraz, Muhammad
unknown
author2_role author
author
author_facet Zidouri1, Abdelmalek
Sarfraz, Muhammad
unknown
author_role author
dc.creator.none.fl_str_mv Zidouri1, Abdelmalek
Sarfraz, Muhammad
unknown
dc.date.*.fl_str_mv 2020
dc.format.none.fl_str_mv application/pdf
dc.identifier.none.fl_str_mv https://eprints.kfupm.edu.sa/id/eprint/1595/1/P109.pdf
ON OPTICAL CHARACTER RECOGNITION OF ARABIC TEXT. The 6th Saudi Engineering Conference, KFUPM, Dhahran, December 2002.
dc.language.none.fl_str_mv en
dc.relation.none.fl_str_mv https://eprints.kfupm.edu.sa/id/eprint/1595/
dc.rights.*.fl_str_mv info:eu-repo/semantics/openAccess
dc.title.none.fl_str_mv ON OPTICAL CHARACTER RECOGNITION OF ARABIC TEXT
dc.type.none.fl_str_mv Article
PeerReviewed
info:eu-repo/semantics/publishedVersion
info:eu-repo/semantics/article
description Although, optical character recognition has made tremendous achievements in the area of desktop publishing, yet a huge amount of work is required to be done. Unlike Roman like languages, there are various languages possessing a large number of fonts and/or having complicated shapes. Arabic language is one of those languages, which is somewhat complicated in its construction. Although a reasonable amount of work has been reported so far for Arabic language but still a good amount of work is needed to be developed. In addition, many other languages also need considerable attention for automatic generation in their recognition. Efficient, robust, and error free methodologies are required to develop systems for such languages so that the recent hardware technologies, to display and print, can be utilized. This work is devoted to one way of addressing the problem of recognition of the Arabic alphabet. We give a brief survey of the state of the art in Arabic Character Recognition and different methods and approaches to this problem. We show that recognition can be achieved by simple matching to prebuilt prototypes of all the Arabic Character set. This free segmentation approach proved to be efficient for the recognition of one font of the Arabic language. We deal with Arabic as a well-structured language and base our prototype description on a method called “Minimum Covering Run Expression”. We also show that our database of prototypes is easily extendable to allow for multifont recognition of Arabic as a basis for a full Arabic OCR system.
eu_rights_str_mv openAccess
format article
id KFUPM_86446736bbba8997d1a92285a34bd9b1
identifier_str_mv ON OPTICAL CHARACTER RECOGNITION OF ARABIC TEXT. The 6th Saudi Engineering Conference, KFUPM, Dhahran, December 2002.
language_invalid_str_mv en
network_acronym_str KFUPM
network_name_str King Fahd University of Petroleum and Minerals
oai_identifier_str oai::1595
publishDate 2020
repository.mail.fl_str_mv
repository.name.fl_str_mv
repository_id_str
spelling ON OPTICAL CHARACTER RECOGNITION OF ARABIC TEXTZidouri1, AbdelmalekSarfraz, MuhammadunknownAlthough, optical character recognition has made tremendous achievements in the area of desktop publishing, yet a huge amount of work is required to be done. Unlike Roman like languages, there are various languages possessing a large number of fonts and/or having complicated shapes. Arabic language is one of those languages, which is somewhat complicated in its construction. Although a reasonable amount of work has been reported so far for Arabic language but still a good amount of work is needed to be developed. In addition, many other languages also need considerable attention for automatic generation in their recognition. Efficient, robust, and error free methodologies are required to develop systems for such languages so that the recent hardware technologies, to display and print, can be utilized. This work is devoted to one way of addressing the problem of recognition of the Arabic alphabet. We give a brief survey of the state of the art in Arabic Character Recognition and different methods and approaches to this problem. We show that recognition can be achieved by simple matching to prebuilt prototypes of all the Arabic Character set. This free segmentation approach proved to be efficient for the recognition of one font of the Arabic language. We deal with Arabic as a well-structured language and base our prototype description on a method called “Minimum Covering Run Expression”. We also show that our database of prototypes is easily extendable to allow for multifont recognition of Arabic as a basis for a full Arabic OCR system.ArticlePeerReviewedinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttps://eprints.kfupm.edu.sa/id/eprint/1595/1/P109.pdf ON OPTICAL CHARACTER RECOGNITION OF ARABIC TEXT. The 6th Saudi Engineering Conference, KFUPM, Dhahran, December 2002. enhttps://eprints.kfupm.edu.sa/id/eprint/1595/2020info:eu-repo/semantics/openAccessoai::15952019-11-01T13:27:26Z
spellingShingle ON OPTICAL CHARACTER RECOGNITION OF ARABIC TEXT
Zidouri1, Abdelmalek
status_str publishedVersion
title ON OPTICAL CHARACTER RECOGNITION OF ARABIC TEXT
title_full ON OPTICAL CHARACTER RECOGNITION OF ARABIC TEXT
title_fullStr ON OPTICAL CHARACTER RECOGNITION OF ARABIC TEXT
title_full_unstemmed ON OPTICAL CHARACTER RECOGNITION OF ARABIC TEXT
title_short ON OPTICAL CHARACTER RECOGNITION OF ARABIC TEXT
title_sort ON OPTICAL CHARACTER RECOGNITION OF ARABIC TEXT
url https://eprints.kfupm.edu.sa/id/eprint/1595/1/P109.pdf