ORAN: a basis for an Arabic OCR system

We present a system called ORAN (offline recognition of Arabic characters and numerals). This system is based on a method called modified MCR (minimum covering run) expression for document images. Using the correspondence between binary images and bipartite graphs, the MCR expression can be found by...

Full description

Saved in:
Bibliographic Details
Main Author: Zidouri, A. (author)
Other Authors: unknown (author)
Format: article
Published: 2004
Subjects:
Online Access:https://eprints.kfupm.edu.sa/id/eprint/14150/1/14150_1.pdf
https://eprints.kfupm.edu.sa/id/eprint/14150/2/14150_2.doc
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1864513393385275392
author Zidouri, A.
author2 unknown
author2_role author
author_facet Zidouri, A.
unknown
author_role author
dc.creator.none.fl_str_mv Zidouri, A.
unknown
dc.date.none.fl_str_mv 2004-10
2020
dc.format.none.fl_str_mv application/pdf
application/msword
dc.identifier.none.fl_str_mv https://eprints.kfupm.edu.sa/id/eprint/14150/1/14150_1.pdf
https://eprints.kfupm.edu.sa/id/eprint/14150/2/14150_2.doc
(2004) ORAN: a basis for an Arabic OCR system. Intelligent Multimedia, Video and Speech Processing, 2004. Proceedings of 2004 International Symposium on, 1.
dc.language.none.fl_str_mv en
en
dc.publisher.none.fl_str_mv IEEE
dc.relation.none.fl_str_mv https://eprints.kfupm.edu.sa/id/eprint/14150/
dc.rights.*.fl_str_mv info:eu-repo/semantics/openAccess
dc.subject.none.fl_str_mv Computer
dc.title.none.fl_str_mv ORAN: a basis for an Arabic OCR system
dc.type.none.fl_str_mv Article
PeerReviewed
info:eu-repo/semantics/publishedVersion
info:eu-repo/semantics/article
description We present a system called ORAN (offline recognition of Arabic characters and numerals). This system is based on a method called modified MCR (minimum covering run) expression for document images. Using the correspondence between binary images and bipartite graphs, the MCR expression can be found by constructing a minimum covering or maximum matching in the corresponding graph. We use the structural information obtained from this expression to describe the character strokes according to some extracted features. These are obtained after a zoning scheme, where the baseline is detected and the line of text divided into four zones. Reference prototypes for the system are built according to a structural description of characters in some model documents. By this method, we overcome the problem of segmentation that is inherent to Arabic characters, even when they are machine printed or typed. Simple matching of the candidate characters to reference prototypes is performed. A recognition rate of more than 97% is achieved.
eu_rights_str_mv openAccess
format article
id KFUPM_22f44d8d566116c8fb0668c23d692b52
identifier_str_mv (2004) ORAN: a basis for an Arabic OCR system. Intelligent Multimedia, Video and Speech Processing, 2004. Proceedings of 2004 International Symposium on, 1.
language_invalid_str_mv en
network_acronym_str KFUPM
network_name_str King Fahd University of Petroleum and Minerals
oai_identifier_str oai::14150
publishDate 2004
publisher.none.fl_str_mv IEEE
repository.mail.fl_str_mv
repository.name.fl_str_mv
repository_id_str
spelling ORAN: a basis for an Arabic OCR systemZidouri, A.unknownComputerWe present a system called ORAN (offline recognition of Arabic characters and numerals). This system is based on a method called modified MCR (minimum covering run) expression for document images. Using the correspondence between binary images and bipartite graphs, the MCR expression can be found by constructing a minimum covering or maximum matching in the corresponding graph. We use the structural information obtained from this expression to describe the character strokes according to some extracted features. These are obtained after a zoning scheme, where the baseline is detected and the line of text divided into four zones. Reference prototypes for the system are built according to a structural description of characters in some model documents. By this method, we overcome the problem of segmentation that is inherent to Arabic characters, even when they are machine printed or typed. Simple matching of the candidate characters to reference prototypes is performed. A recognition rate of more than 97% is achieved.IEEE2004-102020ArticlePeerReviewedinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfapplication/mswordhttps://eprints.kfupm.edu.sa/id/eprint/14150/1/14150_1.pdfhttps://eprints.kfupm.edu.sa/id/eprint/14150/2/14150_2.doc (2004) ORAN: a basis for an Arabic OCR system. Intelligent Multimedia, Video and Speech Processing, 2004. Proceedings of 2004 International Symposium on, 1. enenhttps://eprints.kfupm.edu.sa/id/eprint/14150/info:eu-repo/semantics/openAccessoai::141502019-11-01T14:04:27Z
spellingShingle ORAN: a basis for an Arabic OCR system
Zidouri, A.
Computer
status_str publishedVersion
title ORAN: a basis for an Arabic OCR system
title_full ORAN: a basis for an Arabic OCR system
title_fullStr ORAN: a basis for an Arabic OCR system
title_full_unstemmed ORAN: a basis for an Arabic OCR system
title_short ORAN: a basis for an Arabic OCR system
title_sort ORAN: a basis for an Arabic OCR system
topic Computer
url https://eprints.kfupm.edu.sa/id/eprint/14150/1/14150_1.pdf
https://eprints.kfupm.edu.sa/id/eprint/14150/2/14150_2.doc