Doorgaan naar hoofdnavigatie Doorgaan naar zoeken Ga verder naar hoofdinhoud

Integrated text detection and recognition in natural images

Onderzoeksoutput: Hoofdstuk in Boek/Rapport/CongresprocedureConferentiebijdrageAcademicpeer review

Samenvatting

Text detection and recognition in natural images have conventionally been seen in the prior art as autonomous tasks executed in a strictly sequential processing chain with limited information sharing between sub-systems. This approach is flawed because it introduces (1) redundancy in extracting the same text properties multiple times and (2) error by prohibiting verification of hard (often binarized) detection results at later stages. We explore the possibilities for integration of detection and recognition modules by a feedforward multidimensional information stream. Integration involves suitable characterization of the text string at detection and application of the knowledge to ease recognition by a given OCR system. The choice of characterization properties generally depends on the OCR system, although some of them have proven universally applicable. We show that the proposed integration measures enable more robust recognition of text in complex, unconstrained natural environments. Specifically, integration by the proposed measures (1) eliminates textual input irregularities that recognition engines cannot handle and (2) adaptively tunes the recognition stage for each input image. The former function boosts correct detections, while the latter mainly reduces the number of false positives. Our validation experiments on a set of low-quality natural images show that adaptively tuning the OCR stage to the typical text-to-background transitions in the input image (gradient significance profiling) allows to attain an improvement of 29% in the precision-recall performance, mostly through boosting precision.
Originele taal-2Engels
TitelImage Processing: Algorithms and Systems X; and Parallel Processing for Imaging Applications II, 23-25 January 2012, Burlingame, California
RedacteurenK.O. Egiazarian, S.S. Agaian, A.P. Gotchev, J. Recker, G. Wang
UitgeverijSPIE
ISBN van geprinte versie9780819489425
DOI's
StatusGepubliceerd - 2012
Evenementconference; Image Processing: Algorithms and Systems X; and Parallel Processing for Imaging Applications II; 2012-01-23; 2012-01-25 -
Duur: 23 jan. 201225 jan. 2012

Publicatie series

NaamProceedings of SPIE
Volume8295
ISSN van geprinte versie0277-786X

Congres

Congresconference; Image Processing: Algorithms and Systems X; and Parallel Processing for Imaging Applications II; 2012-01-23; 2012-01-25
Periode23/01/1225/01/12
AnderImage Processing: Algorithms and Systems X; and Parallel Processing for Imaging Applications II

Vingerafdruk

Duik in de onderzoeksthema's van 'Integrated text detection and recognition in natural images'. Samen vormen ze een unieke vingerafdruk.

Citeer dit