In this paper, we propose a new method for segmenting characters in hangul document mixed with alphanumeric characters and picture. Since hangul has structural characteristics different from those of alphanumeric characters, structural characteristics...
In this paper, we propose a new method for segmenting characters in hangul document mixed with alphanumeric characters and picture. Since hangul has structural characteristics different from those of alphanumeric characters, structural characteristics of hangul characters are also different from those of alphanumeric ones. If hangul and alphanumeric characters are both written in a document, it is difficult to know whether the touching characters are hangul or not.
The proposed segmentation method uses an MLP to generate candidate cutting points.
The MLP-based segment lea군 cutting points from training samples which are composed of features extracted from touching character images and correct cutting point of those images. It generates five candidate cutting points per a touching character image and each candidate has a value that is regarded as cutting possibility at that position.