Abstract
This paper presents an algorithm for localization and classification of subtitles in TV videos. We extend an existing static-region detector with object-based adaptive temporal filtering, bounding box computation around blobs of refined static regions, bounding box categorization based on geometry and filling degree of static regions, and subtitle classification using text-stroke alignment features. On a test set of more than 5000 video frames, a Precision rate of 96% is achieved at 98% Recall rate. The system detects subtitles without frame delays, and uses techniques suitable for implementation in a TV platform. We also experimentally show that the picture quality of Motion-Compensated Picture Rate Conversion in televisions can benefit from our system.
Original language | English |
---|---|
Pages (from-to) | 274-282 |
Journal | IEEE Transactions on Consumer Electronics |
Volume | 57 |
Issue number | 1 |
DOIs | |
Publication status | Published - 2011 |