TY - JOUR
T1 - Script determination of mixed Chinese/English document images using Kolmogorov complexity measure
AU - Chi, Zheru
AU - Wang, Qing
PY - 2002
Y1 - 2002
N2 - In this paper, we propose an approach based on Kolmogorov Complexity (KC) measure for determining script classes in mixed Chinese (complex characters)/English document images. This approach, which mainly consists of two steps: document image preprocessing and KC measure, can successfully separate Chinese text lines from English ones. Our approach is robust and reliable in handling document images of different appearances and densities, and various fonts, sizes and styles of characters used in documents. Experimental results on a set of 40 text line images (20 English text lines and 20 Complex Chinese text lines) from various document images show that 100% correct classification rate can be achieved.
AB - In this paper, we propose an approach based on Kolmogorov Complexity (KC) measure for determining script classes in mixed Chinese (complex characters)/English document images. This approach, which mainly consists of two steps: document image preprocessing and KC measure, can successfully separate Chinese text lines from English ones. Our approach is robust and reliable in handling document images of different appearances and densities, and various fonts, sizes and styles of characters used in documents. Experimental results on a set of 40 text line images (20 English text lines and 20 Complex Chinese text lines) from various document images show that 100% correct classification rate can be achieved.
KW - Document image processing
KW - Kolmogorov complexity
KW - Scrip determination
UR - http://www.scopus.com/inward/record.url?scp=0036450080&partnerID=8YFLogxK
U2 - 10.1117/12.477053
DO - 10.1117/12.477053
M3 - 会议文章
AN - SCOPUS:0036450080
SN - 0277-786X
VL - 4875
SP - 686
EP - 692
JO - Proceedings of SPIE - The International Society for Optical Engineering
JF - Proceedings of SPIE - The International Society for Optical Engineering
IS - 2
T2 - Second International Conference on Image and Graphics
Y2 - 16 August 2002 through 18 August 2002
ER -