Filtering in Chinese document images based on templates and confidence measure

Chen Jiewei,Xu Weiran,Guo Jun

Filtering in Chinese document images based on templates and confidence measure

2004

Chen Jiewei
Xu Weiran
Guo Jun

A fast approach to keyword spotting in Chinese document images based on multiple templates matching and confidence measure is presented. The system generates keyword lexicon of diverse fonts and two-stage feature vectors prior to the procedure of keyword searching. A two-stage retrieval scheme and Boyer-Moore Algorithm is proposed aiming at accelerating the retrieval process. A distance measure between the candidate character and the templates is used to identify and rank similar templates. The performance of new system has been significantly improved when compared to traditional OCR and image-based approach. Experimental results confirmed the robust of the proposed approach over a wide range of degradations.

Keywords:

Computer vision
Automatic image annotation
Artificial intelligence
Information retrieval
Keyword spotting
Feature (computer vision)
Pattern recognition
Template matching
Visual Word
Feature detection (computer vision)
Feature extraction
Image retrieval
Computer science
Feature vector

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations