A novel video caption detection approach using multi-frame integration

Rongrong Wang,Wanjun Jin,Lide Wu

A novel video caption detection approach using multi-frame integration

2004

Rongrong Wang
Wanjun Jin
Lide Wu

Captions in videos often play an important role in video information indexing and retrieval. In this paper, we present a novel video caption detection approach. We first apply a new multiple frame integration (MFI) method to minimize variation of the background of the image. A time-based minimum (or maximum) pixel value search is employed and a Sobel edge map is used to determine the mode of search. Then block-based text detection is performed, i.e., a small window is used to scan the image and classify as text or non-text, using Sobel edges as features. We use a two-level pyramid to detect various text sizes. Finally, we present a new iterative text line decomposition method and accurate text bounding boxes are extracted from candidate text areas. Experimental result shows that the proposed approach achieves high precision and recall.

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations