A Generic OCR Using Deep Siamese Convolution Neural Networks

Ghada Sokar, Elsayed E. Hemayed, Mohamed Rehan

Onderzoeksoutput: Hoofdstuk in Boek/Rapport/CongresprocedureConferentiebijdrageAcademicpeer review

17 Citaten (Scopus)

Samenvatting

This paper presents a generic optical character recognition (OCR) system based on deep Siamese convolution neural networks (CNNs) and support vector machines (SVM). Supervised deep CNNs achieve high level of accuracy in classification tasks. However, fine-tuning a trained model for a new set of classes requires large amount of data to overcome the problem of dataset bias. The classification accuracy of deep neural networks (DNNs) degrades when the available dataset is insufficient. Moreover, using a trained deep neural network in classifying a new class requires tuning the network architecture and retraining the model. All these limitations are handled by our proposed system. The deep Siamese CNN is trained for extracting discriminative features. The training is performed once using a group of classes. The OCR system is then used for recognizing different classes without retraining or fine-tuning the deep Siamese CNN model. Only few samples are needed from any target class for classification. The proposed OCR system is evaluated on different domains: Arabic letters, Eastern-Arabic numerals, Hindu-Arabic numerals, and Farsi numerals using test sets that contain printed and handwritten letters and numerals. The proposed system achieves a very promising recognition accuracy close to the results achieved by CNNs trained for specific target classes and recognition systems without the need for retraining. The system outperforms the state of the art method that uses Siamese CNN in one-shot classification task by around 12%.

Originele taal-2Engels
Titel2018 IEEE 9th Annual Information Technology, Electronics and Mobile Communication Conference, IEMCON 2018
RedacteurenSatyajit Chakrabarti, Himadri Nath Saha
UitgeverijInstitute of Electrical and Electronics Engineers
Pagina's1238-1244
Aantal pagina's7
ISBN van elektronische versie9781538672662
DOI's
StatusGepubliceerd - 16 jan. 2019
Evenement9th IEEE Annual Information Technology, Electronics and Mobile Communication Conference, IEMCON 2018 - Vancouver, Canada
Duur: 1 nov. 20183 nov. 2018

Congres

Congres9th IEEE Annual Information Technology, Electronics and Mobile Communication Conference, IEMCON 2018
Land/RegioCanada
StadVancouver
Periode1/11/183/11/18

Vingerafdruk

Duik in de onderzoeksthema's van 'A Generic OCR Using Deep Siamese Convolution Neural Networks'. Samen vormen ze een unieke vingerafdruk.

Citeer dit