TY - JOUR
T1 - Hierarchical content classification and script determination for automatic document image processing
AU - Chi, Zheru
AU - Wang, Qing
AU - Siu, Wan Chi
PY - 2003/11
Y1 - 2003/11
N2 - Page segmentation and image content classification play an important role in automatic image processing with applications to mixed-type document image compression, form and check reading, and automatic mail sorting. In this paper, we first present an enhanced background thinning based approach for fast page segmentation. After the analysis of three different methods individually, a hierarchical approach for document content classification is proposed, which classifies a sub-image into one of two categories: text and halftone. Our approach combines a neural network model, cross-correlation metric, and Kolmogorov complexity measure in a hierarchical structure. Considering the necessity of a recognition system, we also propose using a three-layer feedforward neural network to classify text regions into Chinese and English scripts. The classification accuracy on a number of document images reaches 100% and 97.1% for halftone region and text region, respectively. Meanwhile, the system can achieve a correct rate of 92.3% and 95.0% for Chinese and alphabetic script determination, respectively.
AB - Page segmentation and image content classification play an important role in automatic image processing with applications to mixed-type document image compression, form and check reading, and automatic mail sorting. In this paper, we first present an enhanced background thinning based approach for fast page segmentation. After the analysis of three different methods individually, a hierarchical approach for document content classification is proposed, which classifies a sub-image into one of two categories: text and halftone. Our approach combines a neural network model, cross-correlation metric, and Kolmogorov complexity measure in a hierarchical structure. Considering the necessity of a recognition system, we also propose using a three-layer feedforward neural network to classify text regions into Chinese and English scripts. The classification accuracy on a number of document images reaches 100% and 97.1% for halftone region and text region, respectively. Meanwhile, the system can achieve a correct rate of 92.3% and 95.0% for Chinese and alphabetic script determination, respectively.
KW - Background thinning
KW - Content classification
KW - Cross-correlation
KW - Document image processing
KW - Kolmogorov complexity
KW - Neural networks
KW - Page segmentation
KW - Script determination
UR - http://www.scopus.com/inward/record.url?scp=0141863195&partnerID=8YFLogxK
U2 - 10.1016/S0031-3203(03)00128-6
DO - 10.1016/S0031-3203(03)00128-6
M3 - 文章
AN - SCOPUS:0141863195
SN - 0031-3203
VL - 36
SP - 2483
EP - 2500
JO - Pattern Recognition
JF - Pattern Recognition
IS - 11
ER -