A Generic OCR Using Deep Siamese Convolution Neural Networks

Ghada Sokar, Elsayed E. Hemayed, Mohamed Rehan

    Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

    5 Citations (Scopus)

    Abstract

    This paper presents a generic optical character recognition (OCR) system based on deep Siamese convolution neural networks (CNNs) and support vector machines (SVM). Supervised deep CNNs achieve high level of accuracy in classification tasks. However, fine-tuning a trained model for a new set of classes requires large amount of data to overcome the problem of dataset bias. The classification accuracy of deep neural networks (DNNs) degrades when the available dataset is insufficient. Moreover, using a trained deep neural network in classifying a new class requires tuning the network architecture and retraining the model. All these limitations are handled by our proposed system. The deep Siamese CNN is trained for extracting discriminative features. The training is performed once using a group of classes. The OCR system is then used for recognizing different classes without retraining or fine-tuning the deep Siamese CNN model. Only few samples are needed from any target class for classification. The proposed OCR system is evaluated on different domains: Arabic letters, Eastern-Arabic numerals, Hindu-Arabic numerals, and Farsi numerals using test sets that contain printed and handwritten letters and numerals. The proposed system achieves a very promising recognition accuracy close to the results achieved by CNNs trained for specific target classes and recognition systems without the need for retraining. The system outperforms the state of the art method that uses Siamese CNN in one-shot classification task by around 12%.

    Original languageEnglish
    Title of host publication2018 IEEE 9th Annual Information Technology, Electronics and Mobile Communication Conference, IEMCON 2018
    EditorsSatyajit Chakrabarti, Himadri Nath Saha
    PublisherInstitute of Electrical and Electronics Engineers
    Pages1238-1244
    Number of pages7
    ISBN (Electronic)9781538672662
    DOIs
    Publication statusPublished - 16 Jan 2019
    Event9th IEEE Annual Information Technology, Electronics and Mobile Communication Conference, IEMCON 2018 - Vancouver, Canada
    Duration: 1 Nov 20183 Nov 2018

    Conference

    Conference9th IEEE Annual Information Technology, Electronics and Mobile Communication Conference, IEMCON 2018
    CountryCanada
    CityVancouver
    Period1/11/183/11/18

    Keywords

    • CNN
    • DNN
    • OCR
    • Siamese

    Fingerprint Dive into the research topics of 'A Generic OCR Using Deep Siamese Convolution Neural Networks'. Together they form a unique fingerprint.

    Cite this