ORAN: a basis for an Arabic OCR system
We present a system called ORAN (offline recognition of Arabic characters and numerals). This system is based on a method called modified MCR (minimum covering run) expression for document images. Using the correspondence between binary images and bipartite graphs, the MCR expression can be found by...
Saved in:
| Main Author: | |
|---|---|
| Other Authors: | |
| Format: | article |
| Published: |
2004
|
| Subjects: | |
| Online Access: | https://eprints.kfupm.edu.sa/id/eprint/14150/1/14150_1.pdf https://eprints.kfupm.edu.sa/id/eprint/14150/2/14150_2.doc |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1864513393385275392 |
|---|---|
| author | Zidouri, A. |
| author2 | unknown |
| author2_role | author |
| author_facet | Zidouri, A. unknown |
| author_role | author |
| dc.creator.none.fl_str_mv | Zidouri, A. unknown |
| dc.date.none.fl_str_mv | 2004-10 2020 |
| dc.format.none.fl_str_mv | application/pdf application/msword |
| dc.identifier.none.fl_str_mv | https://eprints.kfupm.edu.sa/id/eprint/14150/1/14150_1.pdf https://eprints.kfupm.edu.sa/id/eprint/14150/2/14150_2.doc (2004) ORAN: a basis for an Arabic OCR system. Intelligent Multimedia, Video and Speech Processing, 2004. Proceedings of 2004 International Symposium on, 1. |
| dc.language.none.fl_str_mv | en en |
| dc.publisher.none.fl_str_mv | IEEE |
| dc.relation.none.fl_str_mv | https://eprints.kfupm.edu.sa/id/eprint/14150/ |
| dc.rights.*.fl_str_mv | info:eu-repo/semantics/openAccess |
| dc.subject.none.fl_str_mv | Computer |
| dc.title.none.fl_str_mv | ORAN: a basis for an Arabic OCR system |
| dc.type.none.fl_str_mv | Article PeerReviewed info:eu-repo/semantics/publishedVersion info:eu-repo/semantics/article |
| description | We present a system called ORAN (offline recognition of Arabic characters and numerals). This system is based on a method called modified MCR (minimum covering run) expression for document images. Using the correspondence between binary images and bipartite graphs, the MCR expression can be found by constructing a minimum covering or maximum matching in the corresponding graph. We use the structural information obtained from this expression to describe the character strokes according to some extracted features. These are obtained after a zoning scheme, where the baseline is detected and the line of text divided into four zones. Reference prototypes for the system are built according to a structural description of characters in some model documents. By this method, we overcome the problem of segmentation that is inherent to Arabic characters, even when they are machine printed or typed. Simple matching of the candidate characters to reference prototypes is performed. A recognition rate of more than 97% is achieved. |
| eu_rights_str_mv | openAccess |
| format | article |
| id | KFUPM_22f44d8d566116c8fb0668c23d692b52 |
| identifier_str_mv | (2004) ORAN: a basis for an Arabic OCR system. Intelligent Multimedia, Video and Speech Processing, 2004. Proceedings of 2004 International Symposium on, 1. |
| language_invalid_str_mv | en |
| network_acronym_str | KFUPM |
| network_name_str | King Fahd University of Petroleum and Minerals |
| oai_identifier_str | oai::14150 |
| publishDate | 2004 |
| publisher.none.fl_str_mv | IEEE |
| repository.mail.fl_str_mv | |
| repository.name.fl_str_mv | |
| repository_id_str | |
| spelling | ORAN: a basis for an Arabic OCR systemZidouri, A.unknownComputerWe present a system called ORAN (offline recognition of Arabic characters and numerals). This system is based on a method called modified MCR (minimum covering run) expression for document images. Using the correspondence between binary images and bipartite graphs, the MCR expression can be found by constructing a minimum covering or maximum matching in the corresponding graph. We use the structural information obtained from this expression to describe the character strokes according to some extracted features. These are obtained after a zoning scheme, where the baseline is detected and the line of text divided into four zones. Reference prototypes for the system are built according to a structural description of characters in some model documents. By this method, we overcome the problem of segmentation that is inherent to Arabic characters, even when they are machine printed or typed. Simple matching of the candidate characters to reference prototypes is performed. A recognition rate of more than 97% is achieved.IEEE2004-102020ArticlePeerReviewedinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfapplication/mswordhttps://eprints.kfupm.edu.sa/id/eprint/14150/1/14150_1.pdfhttps://eprints.kfupm.edu.sa/id/eprint/14150/2/14150_2.doc (2004) ORAN: a basis for an Arabic OCR system. Intelligent Multimedia, Video and Speech Processing, 2004. Proceedings of 2004 International Symposium on, 1. enenhttps://eprints.kfupm.edu.sa/id/eprint/14150/info:eu-repo/semantics/openAccessoai::141502019-11-01T14:04:27Z |
| spellingShingle | ORAN: a basis for an Arabic OCR system Zidouri, A. Computer |
| status_str | publishedVersion |
| title | ORAN: a basis for an Arabic OCR system |
| title_full | ORAN: a basis for an Arabic OCR system |
| title_fullStr | ORAN: a basis for an Arabic OCR system |
| title_full_unstemmed | ORAN: a basis for an Arabic OCR system |
| title_short | ORAN: a basis for an Arabic OCR system |
| title_sort | ORAN: a basis for an Arabic OCR system |
| topic | Computer |
| url | https://eprints.kfupm.edu.sa/id/eprint/14150/1/14150_1.pdf https://eprints.kfupm.edu.sa/id/eprint/14150/2/14150_2.doc |